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HUMAN TYPE H DIABETES GENE - Kv CHANNEL-INTERACTING 
PROTEIN (KChlPl) LOCATED ON CHROMOSOME 5 

RELATED APPLICATIONS 

This application claims priority to U.S. Provisional Application NO. 
5 60/477,1 1 1 filed June 9, 2003, and to U.S. Provisional Application NO. 60/449,945, 
filed on February 25, 2003, and also to U.S. Provisional Application NO. 60/423,545, 
filed on November 1, 2002, the entire contents of all applications are incorporated 
herein by reference. 

10 BACKGROUND OF THE INVENTION 

Diabetes mellitus, a metabolic disease in which carbohydrate utilization is 
reduced and lipid and protein utilization is enhanced, is caused by an absolute or 
relative deficiency of insulin. In the more severe cases, diabetes is characterized by 
chronic hyperglycemia, glycosuria, water and electrolyte loss, ketoacidosis and coma. 

15 Long term complications include development of neuropathy, retinopathy, 

nephropathy, generalized degenerative changes in large and small blood vessels and 
increased susceptibility to infection. The most common form of diabetes is Type II, 
non-insulin-dependent diabetes that is characterized by hyperglycemia due to 
impaired insulin secretion and insulin resistance in target tissues. Both genetic and 

20 environmental factors contribute to the disease^ For example, obesity plays a major 
role in the development of the disease. Type II diabetes is often a mild form of 
diabetes mellitus of gradual onset. 

The health implications of Type II diabetes are enormous. In 1995, there were 
135 million adults with diabetes worldwide. It is estimated that close to 300 million 

25 will have diabetes in the year 2025. (King H., et al 9 Diabetes Care, 21(9): 1414-143 1 
(1998)). The prevalence of Type II diabetes in the adult population in Iceland is 2.5% 
(Vilbergsson, S., et ah, Diabet Med., 14(6): 491-498 (1997)), which comprises 
approximately 5,000 people over the age of 34 who have the disease. The high 
prevalence of the disease and increasing population affected shows an unmet medical 

30 need to define the genetic factors involved in Type H diabetes to more precisely 

define the associated risk factors. Also needed are therapeutic agents for prevention 
of Type II diabetes. 
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SUMMARY OF THE INVENTION 

As described herein, a locus on chromosome 5q35 has been demonstrated 
which plays a major role in Type II diabetes. The locus, referred to as the Type E 
diabetes locus, comprises a nucleic acid that encodes, KChlPl . 
5 The present invention relates to genes located within the Type II diabetes - 

related locus, particularly nucleic acids comprising the KChlPl gene, and the amino 
acids encoded by these nucleic acids. The invention further relates to pathway 
targeting for drug delivery and diagnosis in identifying those who have Type E 
diabetes and those at risk of developing Type II diabetes. Also described are 
10 haplotypes and SNPs that can be used to identify individuals with Type E diabetes or 
at risk of developing Type H diabetes, particularly in those that are non-obese. As a 
consequence, intervention can be prescribed to these individuals before symptoms of 
the disease present, e.g., dietary changes, exercise and/or medication. Identification 
of genes in the Type II diabetes locus can pave the way for a better understanding of 
15 the disease process, which in turn can lead to improved diagnostics and therapeutics. 

The present invention pertains to methods of diagnosing a susceptibility to 
Type n diabetes in an individual, comprising detecting a polymorphism in a KChlPl 
nucleic acid, wherein the presence of the polymorphism in the nucleic acid is 
indicative of a susceptibility to Type E diabetes. The invention additionally pertains 
20 to methods of diagnosing Type E diabetes in an individual, comprising detecting a 

polymorphism in a KChIP 1 nucleic acid, wherein the presence of the polymorphism ^ 
in the nucleic acid is indicative of Type E diabetes, hi one embodiment, in 
diagnosing Type E diabetes or susceptibility to Type E diabetes by detecting the 
presence of a polymorphism in a KChlPl nucleic acid, the presence of the 
25 polymorphism in the KOiTPl nucleic acid can be indicated, for example, by the 
presence of one or more of the polymorphisms indicated in Table 10. 

In other embodiments, the invention relates to methods of diagnosing a 
susceptibility to Type E diabetes in an individual, comprising detecting an alteration 
in the expression or composition of a polypeptide encoded by a KCbJPl nucleic acid 
30 in a test sample, in comparison with the expression or composition of a polypeptide 
encoded by a KChlPl nucleic acid in a control sample, wherein the presence of an 
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alteration in expression or composition of the polypeptide in the test sample is 
indicative of a susceptibility to Type II diabetes. The invention additionally relates to 
a method of diagnosing Type II diabetes in an individual, comprising detecting an 
alteration in the expression or composition of a polypeptide encoded by a KChlPl 
5 nucleic acid in a test sample, in comparison with the expression or composition of a 
polypeptide encoded by KChlPl nucleic acid in a control sample, wherein the 
presence of an alteration in expression or composition of the polypeptide in the test 
sample is indicative of Type II diabetes. 

The invention also relates to an isolated nucleic acid molecule comprising a 
10 KChlPl nucleic acid (e.g., SEQ ID NO: 1 or the complement of SEQ ID NO: 1). In 
certain embodiments, the KChIP 1 nucleic acid comprises one or more nucleotide 
sequence(s) selected from the group of nucleic acid sequences as shown in Table 10 
(e.g., SEQ ID NOs: 1 14-258) and the complements of the group of nucleic acid 
sequences as shown in Table 10. For example, in certain embodiments, the nucleotide 
15 sequence contains one or more polymorphism^), such as those shown in Table 10. In 
another embodiment, the invention relates to an isolated nucleic acid molecule which 
hybridizes under high stringency conditions to a nucleotide sequence selected from 
the group of SEQ ID NO: 1 and the complement of SEQ ID NO: 1. In certain 
embodiments, the isolated nucleic acid molecule hybridizes under high stringency 
20 conditions to a nucleotide sequence comprising one or more nucleotide sequence(s) 
selected from the group of nucleic acid sequences as shown in Table 10 (e.g. 9 SEQ ID 
NOs: 1 14-258) and the complements of the group of nucleic acid sequences as shown 
in Table 10. For example, in certain embodiments, the nucleotide sequence contains 
one or more polymorphism(s), such as those shown in Table 10. 
25 Also contemplated by the invention is a method of assaying for the presence 

of a first nucleic acid molecule in a sample, comprising contacting said sample with a 
second nucleic acid molecule, where the second nucleic acid molecule comprises at 
least one (or more) nucleic acid sequence(s) selected from the group of SEQ ID NOs: 
1 and 1 14-258, inclusive, wherein the nucleic acid sequence hybridizes to the first 
30 nucleic acid under high stringency conditions. In certain embodiments, the second 
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nucleic acid molecule contains one or more polymorphism^), such as those shown in 
Table 10. 

The invention also relates to a vector comprising an isolated nucleic acid 
molecule of the invention {e.g., SEQ ID NOs: 1 and 1 14-258; optionally including 
5 one or more of the polymorphisms shown in Table 10) operably linked to a regulatory 
sequence, as well as to a recombinant host cell comprising the vector. The invention 
also provides a method for producing a polypeptide encoded by an isolated nucleic 
acid molecule having a polymorphism, comprising culturing the recombinant host cell 
under conditions suitable for expression of the nucleic acid molecule. 
10 Also contemplated by the invention is a method of assaying for the presence 

of a polypeptide encoded by an isolated nucleic acid molecule of the invention in a 
sample, the method comprising contacting the sample with an antibody that 
specifically binds to the encoded polypeptide. 

The invention further pertains to a method of identifying an agent that alters 
15 expression of a KChlPl nucleic acid, comprising: contacting a solution containing a 
nucleic acid comprising the promoter region of the KChlPl gene operably linked to a 
reporter gene, with an agent to be tested; assessing the level of expression of the 
reporter gene in the presence of the agent; and comparing the level of expression of 
the reporter gene in the presence of the agent with a level of expression of the reporter 
20 gene in the absence of the agent; wherein if the level of expression of the reporter 
gene in the presence of the agent differs, by an amount that is statistically significant, 
from the level of expression in the absence of the agent, then the agent is an agent that 
alters expression of the KChlPl gene or nucleic acid. An agent identified by this 
method is also contemplated. 
25 The invention additionally comprises a method of identifying an agent that 

alters expression of a KChlPl nucleic acid, comprising contacting a solution 
containing a nucleic acid of the invention or a derivative or fragment thereof, with an 
agent to be tested; comparing expression of the nucleic acid, derivative or fragment in 
the presence of the agent with expression of the nucleic acid, derivative or fragment in 
30 the absence of the agent; wherein if expression of the nucleic acid, derivative or 
fragment in the presence of the agent differs, by an amount that is statistically 
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significant, from the expression in the absence of the agent, then the agent is an agent 
that alters expression of the KChlPl nucleic acid. In certain embodiments, the 
expression of the nucleic acid, derivative or fragment in the presence of the agent 
comprises expression of one or more splicing variants(s) that differ in kind or in 
5 quantity from the expression of one or more splicing variant(s) the absence of the 
agent. Agents identified by this method are also contemplated. 

Representative agents that alter expression of a KChlPl nucleic acid 
contemplated by the invention include, for example, antisense nucleic acids to a 
KChlPl gene or nucleic acid; a KChlPl gene or nucleic acid; a KChlPl polypeptide; 

10 a KChlPl gene or nucleic acid receptor, or other receptor; a KChlPl binding agent; a 
peptidomimetic; a fusion protein; a prodrug thereof; an antibody; and a ribozyme. A 
method of altering expression of a KChlPl nucleic acid, comprising contacting a cell 
containing a nucleic acid with such an agent is also contemplated. 

The invention further pertains to a method of identifying a polypeptide which 

1 5 interacts with a KChIP 1 polypeptide (e.g. , a KChEP 1 polypeptide encoded by a 
nucleic acid of the invention, such as a nucleic acid comprising one or more 
polymorphism^) indicated in Table 10), comprising employing a yeast two-hybrid 
system using a first vector which comprises a nucleic acid encoding a DNA binding 
domain and a KChlPl polypeptide, splicing variant, or a fragment or derivative 

20 thereof, and a second vector which comprises a nucleic acid encoding a transcription 
activation domain and a nucleic acid encoding a test polypeptide. If transcriptional 
activation occurs in the yeast two-hybrid system, the test polypeptide is a polypeptide, 
which interacts with a KChB? 1 polypeptide. 

In certain methods of the invention, a Type II diabetes therapeutic agent is 

25 used. The Type II diabetes therapeutic agent can be an agent that alters (e.g., 
enhances or inhibits) KChlPl polypeptide activity and/or KChlPl nucleic acid 
expression, as described herein (e.g., a nucleic acid agonist or antagonist). 

Type II diabetes therapeutic agents can alter polypeptide activity or nucleic 
acid expression of a KChlPl nucleic acid by a variety of means, such as, for example, 

30 by providing additional polypeptide or upregulating the transcription or translation of 
the nucleic acid encoding the KChlP l polypeptide; by altering posttranslational 
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processing of the KChlPl polypeptide; by altering transcription of splicing variants; 
or by interfering with polypeptide activity (e.g., by binding to the KChlPl 
polypeptide, or by binding to another polypeptide that interacts with KChlPl, such as 
a KChlPl binding agent as described herein), by altering (e.g., downregulating) the 
expression, transcription or translation of a nucleic acid encoding KChlPl; or by 
altering interaction among KChlPl and a KChlPl binding agent. 

In a further embodiment, the invention relates to Type II diabetes therapeutic 
agent, such as an agent selected from the group consisting of: a KChlPl nucleic acid 
or fragment or derivative thereof; a polypeptide encoded by a KChlPl nucleic acid 
(e.g., encoded by a KChlPl nucleic acid having one or more polymorphisin(s) such as 
those set forth in Table 10); a KChlPl receptor; a KChlPl binding agent; a 
peptidomimetic; a fusion protein; a prodrug; an antibody; an agent that alters KChlPl 
gene or nucleic acid expression; an agent that alters activity of a polypeptide encoded 
by a KChlPl gene or nucleic acid; an agent that alters posttranscriptional processing 
of a polypeptide encoded by a KChlPl gene or nucleic acid; an agent that alters 
interaction of a KChIP 1 polypeptide with a KChIP 1 binding agent or receptor; an 
agent that alters transcription of splicing variants encoded by a KChlPl gene or 
nucleic acid; and ribozymes. The invention also relates to pharmaceutical 
compositions comprising at least one Type II diabetes therapeutic agent as described 
herein. 

The invention also pertains to a method of treating a disease or condition 
associated with a KChlPl polypeptide (e.g., Type II diabetes) in an individual, 
comprising administering a Type II diabetes therapeutic agent to the individual, in a 
therapeutically effective amount In certain embodiments, the Type II diabetes 
therapeutic agent is a KChlPl agonist; in other embodiments, the. Type II diabetes 
therapeutic agent is a KChlPl antagonist. The invention additionally pertains to use 
of a Type II diabetes therapeutic agent as described herein, for the manufacture of a 
medicament for use in the treatment of Type H diabetes, such as by the methods 
described herein. 
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A transgenic animal comprising a nucleic acid selected from the group 
consisting of: an exogenous KChlPl gene or nucleic acid and a nucleic acid encoding 
a KChJPl polypeptide, is further contemplated by the invention. 

In yet another embodiment, the invention relates to a method for assaying a 
5 sample for the presence of a KChlPl nucleic acid, comprising contacting the sample 
with a nucleic acid comprising a contiguous nucleotide sequence which is at least 
partially complementary to a part of the sequence of said KChlPl nucleic acid under 
conditions appropriate for hybridization, and assessing whether hybridization has 
occurred between a KChlPl nucleic acid and said nucleic acid comprising a 
1 0 contiguous nucleotide sequence which is at least partially complementary to a part of 
the sequence of said KChlPl nucleic acid; wherein if hybridization has occurred, a 
KChlPl nucleic acid is present in sample. In certain embodiments, the contiguous 
nucleotide sequence is completely complementary to a part of the sequence of said 
KChIP 1 nucleic acid. If desired, amplification of at least part of said KChlPl nucleic 
15 acid can be performed. 

In certain other embodiments, the contiguous nucleotide sequence is 100 or 
fewer nucleotides in length and is either at least 80% identical to a contiguous 
sequence of nucleotides of one or more of SEQ ID NOs: 1 and 1 14-258; at least 80% 
identical to the complement of a contiguous sequence of nucleotides of one or more of 
20 SEQ ID NOs: 1 and 1 14-258; or capable of selectively hybridizing to said KChlPl 
nucleic acid. 

In other embodiments, the invention relates to a reagent for assaying a sample 
for the presence of a KChlPl gene or nucleic acid, the reagent comprising a 
contiguous nucleotide sequence which is at least partially complementary to a part of 

25 the nucleic acid sequence of said KChIP 1 gene or nucleic acid; or comprising a 

contiguous nucleotide sequence which is completely complementary to a part of the 
nucleic acid sequence of said KChlPl gene or nucleic acid. Also contemplated by the 
invention is a reagent kit, e.g., for assaying a sample for the presence of a KChlPl 
nucleic acid, comprising (e.g., in separate containers) one or more labeled nucleic 

30 acids comprising a contiguous nucleotide sequence which is at least partially 

complementary to a part of the nucleic acid sequence of the KChlPl nucleic acid, and 
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reagents for detection of said label* In certain embodiments, the labeled nucleic acid 
comprises a contiguous nucleotide sequence that is completely complementary to a 
part of the nucleotide sequence of said KChlPl gene or nucleic acid. In other 
embodiments, the labeled nucleic acid can comprise a contiguous nucleotide sequence 
5 which is at least partially complementary to a part of the nucleotide sequence of said 
KChlPl gene or nucleic acid, and which is capable of acting as a primer for said 
KChlPl nucleic acid when maintained under conditions for primer extension. 

The invention also provides for the use of a nucleic acid which is 100 or fewer 
nucleotides in length and which is either: a) at least 80% identical to a contiguous 
10 sequence of nucleotides of one or more of SEQ ID NOs: 1 and 1 14-258; b) at least 
80% identical to the complement of a contiguous sequence of nucleotides of one or 
more of SEQ ID NOs: 1 and 1 14-258; or c) capable of selectively hybridizing to said 
KChlPl nucleic acid, for assaying a sample for the presence of a KChlPl nucleic 
acid. 

15 In yet another embodiment, the use of a first nucleic acid which is 1 00 or 

fewer nucleotides in length and which is either: a) at least 80% identical to a 
contiguous sequence of nucleotides of one or more of SEQ ID NOs: 1 and 1 14-258; 
b) at least 80% identical to the complement of a contiguous sequence of nucleotides 
of one or more of SEQ ID NOs: 1 and 114-258; or c) capable of selectively 

20 hybridizing to said KChlPl nucleic acid; for assaying a sample for the presence of a 
KChlPl gene or nucleic acid that has at least one nucleotide difference from the first 
• nucleic acid (e.g. , a SNTP as set forth in Table 1 0), such as for diagnosing a 
susceptibility to a disease or condition associated with a KChlPl . 

The invention also relates to a method of diagnosing Type II diabetes or a 

25 susceptibility to Type II diabetes in an individual, comprising determining the 

presence or absence in the individual of certain "haplotypes" (combinations of genetic 
markers). In one aspect of the invention of diagnosising a susceptibility of the 
disease, methods are described comprising screening for one of the at-risk haplo types 
in the KChlPl gene that is more frequently present in an individual susceptible to 

30 Type II diabetes, compared to the frequency of its presence in the general population, 
wherein the presence of an at-risk haplotype is indicative of a susceptibility to Type II 
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diabetes. An "at-risk haplotype" is intended to embrace one or a combination of 
haplotypes described herein over the KChlPl gene that show high correlation to Type 
II diabetes. In one embodiment, the at-risk haplotype is characterized by the presence 
of at least one single nucleotide polymorphisms as described in Table 13. In one 
5 embodiment, a haplotype associated with Type II diabetes or a susceptibility to Type 
H diabetes comprises one or more haplotypes identified in Table 2 (haplotypes 
identified as Al, A2, A3, A4, A5, A6, Bl, B2, B3, B4 and B5) or Table 5 (haplotypes 
identified as Dl, D2, D3, D4 and D5). hi certain embodiments, a haplotype 
associated with Type II diabetes or a susceptibility to Type II diabetes comprises 
10 markers DG5S879, DG5S881, D5S2075, DG5S883 and DG5S38 at the 5q35 locus; 
or DG5S1058 andDG5S37 at the 5q35 locus; or DG5S1058, DG5S37 and 
DG5S101 at the 5q35 locus; or DG5S881, DG5S1058, D5S2075, DG5S883 and 
DG5S38 at the 5q35 locus; or DG5S879, DG5S1058 and DG5S37; orDG5S881, 
D5S2075, DG5S883 and DG5S38 at the 5q35 locus; DG5S953, DG5S955, DG5S13 
15 and DG5S959 at the 5q35 locus; or DG5S888 and DG5S953 at the 5q35 locus; or 
DG5S953, DG5S955 and DG5S124 at the 5q35 locus; or DG5S888, DG5S44 and 
DG5S953 at the 5q35 locus; or DG5S953, DG5S955, DG5S13, DG5S123, and 
DG5S959 at the 5q35 locus. The presence of the haplotype is diagnostic of Type II 
diabetes or of a susceptibility to Type II diabetes. Also described herein is a 
20 haplotype associated with Type II diabetes or a susceptibility to Type II diabetes 
comprising markers DG5S13, KCP_1 152, and D5S625 at the 5q35 locus; the 
presence of the haplotype is diagnostic of Type E diabetes or of a susceptibility to 
Type II diabetes. In one particular embodiment, the presence of the -4, 1, 0 haplotype 
at DG5S13, KCP_1 152, andD5S625 is diagnostic of Type H diabetes or of a 
25 susceptibility to Type II diabetes. In another embodiment, a haplotype associated 
with Type II diabetes or a susceptibility to Type II diabetes in an individual, 
comprises markers DG5S124, KCPJ 152, KCPJ2649, KPC 4976 and KPC-16152 at 
the 5q35 locus. In one particular embodiment, the presence of the 0, 1, 1, 3 and 0 
haplotype at DG5S124, KCP_1152, KCPJ2649, KPC_4976 and KPC-16152 is 
30 diagnostic of Type II diabetes or of a susceptibility to Type H diabetes. In another 
embodiment, a haplotype associated with Type II diabetes or a susceptibility to Type 
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II diabetes in an individual, comprises markers KCP_1 73982, KCPJ5400, and 
KCP_18069. In one particular embodiment, the presence of the 0, 1, 1 haplotype at 
KCP_1 73982, KCP_1 5400, and KC3M8069 is diagnostic of Type H diabetes or of a 
susceptibility to Type IT diabetes. 
5 In additional embodiments, a haplotype associated with Type II diabetes or a 

susceptibility to Tj^pe II diabetes comprises markers DG5S124, KCP_1 152, 
KCP 2649, KCP_4976, and KCP_16152 at the 5q35 locus, as well as one of the 
following 3 markers: KCP_197678, KCP_197775, and KCP_202795 at the 5q35 
locus; the presence of the haplotype is diagnostic of Type n diabetes or of a 

10 susceptibility to Type II diabetes. In particular embodiments, the presence of the 0, 3, 
1, 1, 3, 0 haplotype at DG5S124, KCP_197679, KCP_1 152, KCP_2649, KCP_4976, 
and KCP_16152; the presence of the 0, 3, 1, 1, 3, 0 haplotype at DG5S124, 
KCP_197775, KCP_1152, KCP__2649, KCP_4976, and KCP_16152; or the presence 
of the 0, 1, 1, 1, 3, 0 haplotype at DG5S124, KCP_202795, KCP_1 152, KCP_2649, 

15 KCP_4976, and KCP_J6152; is diagnostic of Type II diabetes or of a susceptibility to 
Type II diabetes. 

The presence or absence of the haplotype can be determined by various 
methods, including, for example, using enzymatic amplification of nucleic acid from 
the individual, electrophoretic analysis, restriction fragment length polymorphism 

20 analysis and/or sequence analysis. 

Also described herein is a method of diagnosing Type II diabetes in an 
individual, comprising determining the presence or absence in the individual of a 
haplotype comprising one or more markers and/or single nucleotide polymorphisms 
as shown in Table 10, Table 2, Table 5 and/or Table 13 in the locus on chromosome 

25 5q35, wherein the presence of the haplotype is diagnostic of Type II diabetes. Also 
contemplated is a method of diagnosing a susceptibility to Type II diabetes in an 
individual, comprising determining the presence or absence in the individual of a 
haplotype comprising one or more markers and/or single nucleotide polymorphisms 
as shown in Table 10 and/or Table 13 in the locus on chromosome 5q35, wherein the 

30 presence of the haplotype is diagnostic of a susceptibility to Type II diabetes. 
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A method for the diagnosis and identification of a susceptibility to Type II 
diabetes in an individual is also described, comprising: screening for an at-risk 
haplotype in the KChlPl nucleic acid that is more frequently present in an individual 
susceptible to Type II diabetes compared to an individual who is not susceptible to 
5 Type H diabetes, wherein the at-risk haplotype increases the risk significantly. In 
certain embodiments, the significant increase is at least about 20% or the significant 
increase is identified as an odds ratio of at least about 1 .2. 

A major application of the current invention involves prediction of those at 
higher risk of developing a Type II diabetes. Diagnostic tests that define genetic 

10 factors contributing to Type II diabetes might be used together with or independent of 
the known clinical risk factors to define an individual's risk relative to the general 
population. Better means for identifying those individuals at risk for Type E diabetes 
should lead to better prophylactic and treatment regimens, including more aggressive 
management of the current clinical risk factors. 

15 Another application of the current invention is the specific identification of a 

rate-limiting pathway involved in Type II diabetes. A disease gene with genetic 
variation that is significantly more common in diabetic patients as compared to 
controls represents a specifically validated causative step in the pathogenesis of Type 
II diabetes. That is, the uncertainty about whether a gene is causative or simply 

20 reactive to the disease process is eliminated. The protein encoded by the disease gene 
defines a rate-limiting molecular pathway involved in the biological process of Type 
II diabetes predisposition. The proteins encoded by such Type II genes or its 
interacting proteins in its molecular pathway may represent drug targets that may be 
selectively modulated by small molecule, protein, antibody, or nucleic acid therapies. 

25 Such specific information is greatly needed since the population affected with Type II 
diabetes is growing. 

A third application of the current invention is its use to predict an individual's 
response to a particular drug, even drugs that do not act on KChlPl or its pathway. It 
is a well-known phenomenon that in general, patients do not respond equally to the 
30 same drug. Much of the differences in drug response to a given drug is thought to be 
based on genetic and protein differences among individuals in certain genes and their 
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corresponding pathways. Our invention defines the association of KChlPl with Type 
II diabetes. Some current or future therapeutic agents may be able to affect this gene 
directly or indirectly and therefore, be effective in those patients whose Type H 
diabetes risk is in part determined by the KChlPl genetic variation. On the other 
5 hand, those same drugs may be less effective or ineffective in those patients who do 
not have at risk variation in the KChIP 1 gene. Therefore, KChTP 1 variation or 
haplotypes may be used as a pharmacogenomic diagnostic to predict drag response 
and guide choice of therapeutic agent in a given individual. 

10 BRIEF DESCRIPTION OF THE DRAWINGS . 

The foregoing and other objects, features and advantages of the invention will 

be apparent from the following more particular description of preferred embodiments 

of the invention, as illustrated in the accompanying drawings. 

FIG. 1.1 through 1.148 show the KChlPl genomic DNA (SEQ ID NO: 1). 
15 This sequence is taken from NCBI Build 33. The numbering in FIG. 1, as well as the 

"start" and "end" numbers in all Tables refer to the location in Chromosome 5 in 

NCBI Build 33. The numbering in FIG. 1 refers to the last base in the line 

immediately preceding the number; the numbers are in decreasing order because of 

the "reverse orientation" of the gene. 
20 FIG. 2 shows the amino acid sequence of KChIP 1 as published by An et al 

Nature, 403(6768): 553-6 (2000) (SEQ ID NO: 2). 

FIG. 3 shows the nucleic acid sequence (SEQ ID NO: 3) encoding the amino 

acid sequence of KChlPl as published by An et al .Nature, 403(6768): 553-6 (2000) 

(SEQ ID NO: 2). 

25 FIG. 4 is a series of graphs showing the results of a genome-wide scan using 

906 microsatellite markers. Results are shown for three phenotypes: all Type H 
diabetics (solid lines), obese Type II diabetics (dotted lines) and non-obese Type II 
diabetics (dashed lines). The multipoint allele-sharing LOD-score is on the vertical 
axis, and the centimbrgan distance from the P-terminus of the chromosome is on the 

30 horizontal axis. 



_2004041 1 93A2_I_> 



WO 2004/041193 



PCT/US2003/034681 



-13- 

FIG. 5 graphically depicts the multipoint allele-sharing LOD-score of the 
locus on chromosome 5 after 38 microsatellite markers have been added to the 
framework set in a 40-cM interval, from 160 cM to 200 cM. Results are shown for 
the same three phenotypes as in FIG. 4; all Type II diabetics (solid line), non-obese 
5 Type II diabetics(dashed line) and obese Type II diabetics (dotted line).the results of 
a genome-wide scan using 906 microsatellite markers. 

FIG. 6 graphically depicts the single-marker and haplotype association within 
the 1-LOD-drop for 590 non-obese diabetics vs 477 unrelated population controls. 
The location of the markers and haplotypes is on the horizontal axis and the 
10 corresponding two-sided P-value on the vertical axis. All haplotypes with a P- value 
less than 0.01 are shown. The horizontal bars indicate the span of the corresponding 
haplotypes and the marker density is shown at the bottom of the figure. All locations 
refer to NCBI Build 33 and the 1-LOD-drop spans from 167.64 to 171.28 Mb. 

FIG. 7 schematically shows the location of genes and markers in region B. 
15 The microsatellites used in the locus-wide association study are shown as filled circles 
at the top. The filled boxes indicate the locations of exons, or clusters of exons, for 
KCIDP1. The shaded boxes indicated the location and size of the neighboring genes, 
LCP2, KCNMB 1 , GABRP and RANBP 1 7/ and the grey horizontal lines indicate the 
span of the five most significant microsatellite haplotypes in the region. 

20 

DETAILED DESCRIPTION OF THE INVENTION 

Extensive genealogical information for a population with population-based 
lists of patients with Type II diabetes has been combined with powerful gene sharing 
methods to map a locus on chromosome 5q35. Diabetics and their relatives were 
25 genotyped with a genome- wide marker set including 906 microsatellite markers, with 
an average marker density of 4cM. Due to the role obesity plays in the development 
of diabetes, the material was fractionated according to body mass index (BMI). 
Presented herein are results of a genome wide search of genes that cause Type II 
diabetes in Iceland. 

30 
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Loci Associated with Diabetes 

Evidence for genes causing the early onset monogenic form of diabetes have 
been previously identified. Mutations in six genes have been discovered that cause 
MODY, or maturity onset diabetes of the young. MODY1 - MODY6 are due to 
5 mutations in HNF4a, glucokinase, HNFla, IPF1, HNFlb andNEURODl (MODY1: 
Yamagata K, et al. Nature 384:458-460 (1996); MODY2: Froguel P, F et al, 
Nature 356: 162-164(1992); MODY3: Yamagata, K. 5 et al, Nature 384: 455-458 
(1996); MODY4: Yoshioka M., era/., Diabetes May,46(5):887-94 (1997)MODY5: 
, Horikawa, Y., et al, Nat. Genet. 17: 384-385 (1997) MODY6: Kristinsson S.Y., et 

10 al, Diabetologia Nov:44(ll):2098-103 (2001)). 

One gene has been identified as a disease gene that contributes to the late- 
onset form of diabetes, the calpain 10 gene (CAPN10). CAPN10, was identified 
though a genome- wide screen of Mexican American sibpairs with diabetes 
(Horikawa, Y, et al, Nat. Genet. 26(2) 163-175(2000)). The risk allele has been 

15 shown to be associated with impaired regulation of glucose-induced secretion and 

decreased rate of insulin-stimulated glucose disposal (Lynn, S., etal, Diabetes, 51(1): 
247-250 (2002); Sreenan, S.K., et al, Diabetes 50(9) 2013-2020 (2001) and Baier, L. 
J., et al, J. Clin. Invest. 106(7) R69-73 (2000)). 

Many genome-wide screens in a variety of populations have been p erformed 

20 that have resulted in major loci for Diabetes, Loci are reported on chromosome 2q37 
(Hanis, C.L., etal, Nat. Genet., 13(2):161-166 (1996)), chromosome 15q21 (Cox, et 
al, Nat. Genet. 21(2):213-215 (1999)), chromosome 10q26 (Duggirala, R., et al, Am. 
J. Hum. Genet., 68(5): 1149-1 164 (2001)), chromosome 3p (Ehm, M.G., et al, Am. J. 
Hum. Genet, 66(6):1871-1881 (2000)) in Mexican Americans, and chromosomes 

25 lq21-23 and 1 Iq23-q25 (Hanson R. L. et al, Am J. Hum Genet., 63(4):1 130-1 138 
(1998)) in PIMA Indians. In the Caucasian population, linkages have been observed 
to chromosome 12q24 in Finns (Mahtani, et al, Nat. Genet., 14(l):90-4 (1994)), 
chromosome Iq21-q23 in Americans in Utah (Elbein, S.C., et al, Diabetes, 
48(5): 1 1 75-1 1 82 (1999)), chromosome 3q27-pter in French families (Vionnet, N;, et 

30 al, Am. J. Hum. Genet. 67(6):1470-80 (2000) and chromosome 18pll in 

Scandinavians (Parker, A, etal, Diabetes, 50(3) 675-680 (2001)). A recent study 
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reported a major locus in indigenous Australians on chromosome 2q24.3 (Busfield, 
F,. et al, Am. J. Hum. Genet, 70(2): 349-357 (2002)). Many other studies have 
resulted in suggestive loci or have replicated these loci. 

Association studies have been reported for Type II diabetes. Most of these 
5 studies show modest association to the disease in a group of people but do not account 
for the disease. Altshuler et al, reviewed the association work that has been done and 
concluded that association to only one of 16 genes revealed held up to scrutiny. 
Altshuler et al, confirmed that the Pro 12 Ala polymorphism in PPARg is associated 
with Type II diabetes. Until now, there have been no linkage studies in Type II 
10 diabetes linking the disease to chromosome 5q35 

KChlPl 

The invention described herein has linked Type II diabetes to a gene encoding 
Kv channel-interacting protein 1 (KChlPl; also known as KCNIP1). In the brain and 

15 heart, rapidly inactivating (A-type) voltage-gated potassium (Kv) currents operate at 
subthreshold membrane potentials to control the excitability of neurons and cardiac 
myocytes. Although pore-forming alpha-subunits of the Kv4, or Shal-related, channel 
family form A-Type currents in heterologous cells, these differ significantly from 
native A-Type currents. To identify proteins that interacted with the Kv4 subunit, An 

20 et al, ("Modulation of A-Type potassium channels by a family of calcium sensors" 
Nature 403:553-6 (2000)) used the yeast two-hybrid system with the intracellular 
amino terminus of the rat Kv4.3 subunit to screen rat midbrain cDNA libraries. Two 
Kv channel-interacting proteins were identified and called KChlPs (KChIP-1 and 
KChIP2). Library screening and database mining identified mouse and human 

25 orthologs of these genes. The KChlPl cDNA encodes a 216-amino acid protein. The 
KChlPs have 4 EF-hand-like domains and bind calcium ions. Both KChlPs have 
distinct N termini but share approximately 70% amino acid identity throughout a 
carboxy-terminal 185-amino acid core domain that contains the 4 EF-hand-like 
motifs. Although the KChlPs have around 40% amino acid similarity to neuronal 

30 calcium sensor- 1 and are members of the recoverin /NCS subfamily of calcium- 
binding proteins, other members of this subfamily, such as hippocalcin, did not 
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interact with Kv4 channels in the yeast 2-hybrid assay. An et aL, {supra) additionally 
found that expression of KChlPs and Kv4 together reconstitutes several features of 
native A-Type currents by modulating the density, inactivation kinetics, and rate of 
recovery from inactivation of Kv4 channels in heterologous cells. Both KChlPs 
5 colocalize and coimmunoprecipitate with brain Kv4 alpha-subunits, and are thus 

integral components of native Kv4 channel complexes. As the activity and density of 
neuronal A-Type currents tightly control responses to excitatory synaptic inputs, these 
KChlPs may regulate A-Type currents, and hence neuronal excitability, in response to 
changes in intracellular calcium. 

10 The glycosphingo lipid sulfatide is present in secretory granules and at the 

surface of pancreatic (3-cells (Buschard K, Fredman P. "Sulphatide as an antigen in 
diabetes mellitus". Diabetes fNutr Metab 4:221 -228 (1996)), and antisulfatide 
antibodies (ASA; IgGl) are found in serum from the majority of patients with newly 
diagnosed Type I diabetes. Buschard et aL, ("Sulfatide controls insulin secretion by 

15 modulation of ATP-sensitive K(+)-channel activity and Ca(24-)-depehdent exocytosis 
in rat pancreatic beta-cells" Diabetes 51:2514-21 (2002)) demonstrated that sulfatide 
produced a glucose- and concentration-dependent inhibition of insulin release from 
isolated rat pancreatic islets. This inhibition of insulin secretion was due to activation 
of ATP-sensitive K + -(K A tp) channels in single rat p-cells. No effect of sulfatide was 

20 observed on whole-cell Ca 2+ -channel activity or glucose-induced elevation of 
cytoplasmic Ca 2+ concentration. A key observation was that sulfatide stimulated 
Ca 2+ -dependent exocytosis determined by capacitance measurements and depolarized- 
induced insulin secretion from islets exposed to diazoxide and high external KC1. The 
monoclonal sulfatide antibody Sulph I as well as ASA-positive serum reduced 

25 glucose-induced insulin secretion by inhibition of Ca 2+ -dependent exocytosis. This . 
suggests that sulfatide is important for the control of glucose-induced insulin secretion 
and that both an increase and a decrease in the sulfatide content have an impact on the 
secretory capacity of the individual p-cells. 
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ASSESSMENT FOR AT-RISK HAPLOTYPES 

A "haplotype," as described herein, refers to a combination of genetic markers 
("alleles"), such as those set forth in Table 2 and Table 5. In a certain embodiment, 
the haplotype can comprise one or more alleles, two or more alleles, three or more 
5 alleles, four or more alleles, or five or more alleles. The genetic markers are 

particular "alleles" at "polymorphic sites" associated with KChPIl . A nucleotide 
position at which more than one sequence is possible in a population (either a natural 
population or a synthetic population, e.g., a library of synthetic molecules) is referred 
to herein as a "polymorphic site". Where a polymorphic site is a single nucleotide in 
0 length, the site is referred to as a single nucleotide polymorphism ("SNP"). For 

example, if at a particular chromosomal location, one member of a population has an 
adenine and another member of the population has a thymine at the same position, 
then this position is a polymorphic site, and, more specifically, the polymorphic site is 
a SNP. Polymorphic sites can allow for differences in sequences based on 
5 substitutions, insertions or deletions, Each version of the sequence with respect to the 
polymorphic site is referred to herein as an "allele" of the polymorphic site. Thus, in 
the previous example, the SNP allows for both an adenine allele and a thymine allele. 

Typically, a reference sequence is referred to for a particular sequence. 
Alleles that differ from the reference are referred to as "variant" alleles. For example, 
the reference KChPIl sequence is described herein by SEQ ID NO: 1 The term, 
"variant KChPIl", as used herein, refers to a sequence that differs from SEQ ID NO: 
1 but is otherwise substantially similar. The genetic markers that make up the 
haplotypes described herein are KChPIl variants. Additional variants can include 
changes that affect a potypeptide, e.g., the KChPIl polypeptide. These sequence 
differences, when compared to a reference nucleotide sequence, can include the 
insertion or deletion of a single nucleotide, or of more than one nucleotide, resulting 
in a frame shift; the change of at least one nucleotide, resulting in a change in the 
encoded amino acid; the change of at least one nucleotide, resulting in the generation 
of a premature stop codon; the deletion of several nucleotides, resulting in a deletion 
of one or more amino acids encoded by the nucleotides; the, insertion of one or several 
nucleotides, such as by unequal recombination or gene conversion, resulting in an 
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interruption of the coding sequence of a reading frame; duplication of all or a part of a 
sequence; transposition; or a rearrangement of a nucleotide sequence, as described in 
detail above. Such sequence changes alter the polypeptide encoded by a KChPIl 
nucleic acid. For example, if the change in the nucleic acid sequence causes a frame 
5 shift, the frame shift can result in a change in the encoded amino acids, and/or can 
result in the generation of a premature stop codon, causing generation of a truncated 
polypeptide. Alternatively, a polymorphism associated with Type IT diabetes or a 
susceptibility to Type II diabetes can be a synonymous change in one or more 
nucleotides (Le T , a change that does not result in a change in the amino acid 

10 sequence). Such a polymorphism can, for example, alter splice sites, affect the 

stability or transport of mRNA, or otherwise affect the transcription or translation of 
the polypeptide. The polypeptide encoded by the reference nucleotide sequence is the 
"reference" polypeptide with a particular reference amino acid sequence, and 
polypeptides encoded by variant alleles are referred to as "variant" polypeptides with 

15 variant amino acid sequences. 

Haplotypes are a combination of genetic markers, e.g., particular alleles at 
polymorphic sites. The haplotypes described herein, e.g., having markers such as 
those shown in Table 6, Table 7, Table 9, Table 11, Table 12 and Table 13 are found 
more frequently in individuals with Type II diabetes than in individuals without Type 

20 II diabetes. Therefore, these haplotypes have predictive value for detecting Type II 
diabetes or a susceptibility to Type n diabetes in an individual. The haplotypes 
described herein are a combination of various genetic markers, e.g., SNPs and 
microsatellites. Therefore, detecting haplotypes can be accomplished by methods 
known in the art for detecting sequences at polymorphic sites, such as the methods 

25 described above. 

In certain methods described herein, an individual who is at risk for Type II 
diabetes is an individual in whom an at-risk haplotype is identified. In one 
embodiment, the at-risk haplotype is one that confers a significant risk of Type II 
diabetes. In one embodiment, significance associated with a haplotype is measured 

30 by an odds ratio. In a further embodiment, the significance is measured by a 

percentage. In one embodiment, a significant risk is measured as an odds ratio of at 
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least about 1.2, including but not limited to: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8 and 1.9. In 
a further embodiment, an odds ratio of at least 1 .2 is significant. In a further 
embodiment, an odds ratio of at least about 1.5 is significant. In a further 
embodiment, a significant increase in risk is at least about 1.7 is significant. In a 
5 further embodiment, a significant increase in risk is at least about 20%, including but 
not limited to about 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 
80%, 85%, 90%, 95% and 98%. In a further embodiment, a significant increase in 
risk is at least about 50%. It is understood however, that identifying whether a risk is 
medically significant may also depend on a variety of factors, including the specific 
10 disease, the haplotype, and often, environmental factors. 

An at-risk haplotype in, or comprising portions of, the KChPIl gene, is one 
where the haplotype is more frequently present in an individual at risk for Type II 
diabetes (affected), compared to the frequency of its presence in a healthy individual 
(control), and wherein the presence of the haplotype is indicative of Type II diabetes 
15 or susceptibility to Type II diabetes. 

Standard techniques for genotyping for the presence of SNPs and/or 
microsatellite markers can be used, such as fluorescent-based techniques (Chen, et ah, 
Genome Res. P, 492 (1999)), PCR, LCR, Nested PCR and other techniques for nucleic 
acid amplification. In one embodiment, the method comprises assessing in an 
20 individual the presence or frequency of SNPs and/or microsatellites in, comprising 
portions of, the KChlPlgene, wherein an excess or higher frequency of the SNPs 
and/or microsatellites compared to a healthy control individual is indicative that the 
individual has Type II diabetes, or is susceptible to Type II diabetes. See, for 
example, Table 6, Table 7, Table 9, Table 1 1, Table 12 and 13 (below) for SNPs and 
25 markers that can form haplotypes that can be used as screening tools. These markers 
and SNPs can be identified in at-risk haploptypes. For example, an at-risk haplotype 
can include microsatellite markers and/or SNPs such as those set forth in Table 2 and 
Table 5. The presence of the haplotype is indicative a susceptibility to Type II 
diabetes, and therefore is indicative of an individual who falls vritbin a target 
30 population for the treatment methods described herein. 
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NUCLEIC ACro THERAPEUTIC AGENTS 

In another embodiment, a nucleic acid of the invention; a nucleic acid 
complementary to a nucleic acid of the invention; or a portion of such a nucleic acid 
(e.g., an oligonucleotide as described below); or a nucleic acid encoding a KChlPl 
5 polypeptide, can be used in "antisense" therapy, in which a nucleic acid (e.g., an 

oligonucleotide) which specifically hybridizes to the mRNA and/or genomic DNA of 
a nucleic acid is administered or generated in situ. The antisense nucleic acid that 
specifically hybridizes to the mRNA and/or DNA inhibits expression of the 
polypeptide encoded by that mRNA and/or DNA, e.g., by inhibiting translation and/or 
10 transcription. Binding of the antisense nucleic acid can be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes, through 
specific interaction in the major groove of the double helix. 

An antisense construct can be delivered, for example, as an expression 
plasmid as described above. When the plasmid is transcribed in the cell, it produces 
15 RNA that is complementary to a portion of the mRNA and/or DNA that encodes a 
KChlPl polypeptide. Alternatively, the antisense construct can be an oligonucleotide 
probe that is generated ex vivo and introduced into cells; it then inhibits expression by 
; hybridizing with the mRNA and/or genomic DNA of the polypeptide. In one 

embodiment, the oligonucleotide probes are modified oligonucleotides that are 
20 resistant to endogenous nucleases, e.g., exonucleases and/or endonucleases, thereby 
rendering them stable in vivo. Exemplary nucleic acid molecules for use as antisense 
oligonucleotides are phosphoramidate, phosphothioate and methylphosphonate 
analogs of DNA (see also U.S. Patent Nos. 5,176,996, 5,264,564 and 5,256,775). 
Additionally, general approaches to constructing oligomers useful in antisense therapy 
25 are also described, for example, by Van der Krol et al. (BioTechniques 6:958-976 
(1988)); and Stein et al. (Cancer Res. 48:2659-2668 (1988)). With respect to 
antisense DNA, oligodeoxyribonucleotides derived from the translation initiation site 
are preferred. 

To perform antisense therapy, oligonucleotides (mRNA, cDNA or DNA) are 
30 designed that are complementary to mRNA encoding the polypeptide. The antisense 
oligonucleotides bind to mRNA transcripts and prevent translation. Absolute 
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complementarity, although preferred, is not required, A sequence "complementary" 
to a portion of an RNA, as referred to herein, indicates that a sequence has sufficient 
complementarity to be able to hybridize with the RNA, forming a stable duplex; in the 
case of double-stranded antisense nucleic acids, a single strand of the duplex DNA 
5 may thus be tested, or triplex formation may be assayed. The ability to hybridize will 
depend on both the degree of complementarity and the length of the antisense nucleic 
acid, as described in detail above. Generally, the longer the hybridizing nucleic acid, 
the more base mismatches with an RNA it may contain and still form a stable duplex 
(or triplex, as the case may be). One skilled in the art can ascertain a tolerable degree 

10 of mismatch by use of standard procedures. 

The oligonucleotides used in antisense therapy can be DNA, RNA, or 
chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotides can be modified at the base moiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule, 

15 hybridization, etc. The oligonucleotides can include other appended groups such as 
peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al. 9 Proc. NatL Acad. Set USA 
86:6553-6556 (1989); Lemaitre et aL, Proc. NatL Acad. Set USA 84:648-652 (1987); 
PCT International Publication NO: WO 88/098 1 0) or the blood-brain barrier (see, 

20 e.g. 9 PCT International Publication NO: WO 89/10134), or hybridization-triggered 
cleavage agents (see, e. g. 9 Krol et al. 9 BioTechniques 6:958-976 (1988)) or 
intercalating agents. (See, e.g., Zon, Pharnt.Res. 5: 539-549 (1988)). To this end, the 
oligonucleotide may be conjugated to another molecule (e.g. 9 a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent). 

25 The antisense molecules are delivered to cells that express a KChJPl 

polypeptide in vivo. A number of methods can be used for delivering antisense DNA 
or RNA to cells; e.g. 9 antisense molecules can be injected directly into the tissue site, 
or modified antisense molecules, designed to target the desired cells (e.g. 9 antisense 
linked to peptides or antibodies that specifically bind receptors or antigens expressed 

30 on the target cell surface) can be administered systematically. Alternatively, in a 

another embodiment, a recombinant DNA construct is utilized in which the antisense 
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oligonucleotide is placed under the control of a strong promoter (e.g. , pol HI or pol 
II). The use of such a construct to transfect target cells in the patient results in the 
transcription of sufficient amounts of single stranded RNAs that will form 
complementary base pairs with the endogenous transcripts and thereby prevent 

5 translation of the mRNA. For example, a vector can be introduced in vivo such that it 
is taken up by a cell and directs the transcription of an antisense RNA. Such a vector 
can remain episomal or become chromosomally integrated, as long as it can be 
transcribed to produce the desired antisense RNA. Such vectors can be constructed 
by recombinant DNA technology methods standard in the art and described above* 

10 For example, a plasmid, cosmid, YAC or viral vector can be used to prepare the 
recombinant DNA construct that can be introduced directly into the tissue site. 
Alternatively, viral vectors can be used which selectively infect the desired tissue, in 
which case administration may be accomplished by another route (e.g. , systemically). 
In another embodiment of the invention, small double-stranded interfering RNA 

15 (RNA interference (RNAi)) can be used. KNAi is a post-transcription process, in 
which double-stranded RNA is introduced, and sequence-specific gene silencing 
results, though catalytic degradation of the targeted mRNA. See, e.g., Elbashir, S.M. 
et ah, Nature 4/7:494-498 (2001); Lee, N.S., Nature Biotech. 79:500-505 (2002); 
Lee, S-K. et al, Nature Medicine 8(7) :68 1-686 (2002); the entire teachings of these 

20 references are incorporated herein by reference. 

Endogenous expression of a gene product can also be reduced by inactivating 
or c Tcnocldng out" the gene or its promoter using targeted homologous recombination 
(e.g., see Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 
51:503-512 (1987); Thompson et al, Cell 5:313-321 (1989)). For example, an 

25 altered, non-functional gene (or a completely unrelated DNA sequence) flanked by 
DNA homologous to the endogenous gene (either the coding regions or regulatory 
regions of the gene) can be used, with or without a selectable marker and/or a 
negative selectable marker, to transfect cells that express the gene in vivo. Insertion 
of the DNA construct, via targeted homologous recombination, results in inactivation 

30 of the gene. The recombinant DNA constructs can be directly administered or 
targeted to the required site in vivo using appropriate vectors, as described above. 
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Alternatively, expression of non-altered genes can be increased using a similar 
method: targeted homologous recombination can be used to insert a DNA construct 
comprising a non-altered functional gene, or the complement thereof, or a portion 
thereof, in place of an gene in the cell, as described above. In another embodiment, 
5 targeted homologous recombination can be used to insert a DNA construct 

comprising a nucleic acid that encodes a polypeptide variant that differs from that 
present in the cell. 

Alternatively, endogenous expression of a gene product can be reduced by 
targeting deoxyribonucleotide sequences complementary to the regulatory region (i.e., 

10 the promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the gene in target cells in the body. (See generally, Helene, C, 
Anticancer Drug Des., 6(6):569-84 (1991); Helene, C. etal.,Ann. N.Y. Acad, Set 
660:27-36 (1992); and Maher, L. J., Bioassays 14(12):807-15 (1992)). Likewise, the 
antisense constructs described herein, by antagonizing the normal biological activity 

15 of the gene product, can be used in the manipulation of tissue, e.g. , tissue 

differentiation, both in vivo and for ex vivo tissue cultures. Furthermore, the anti- 
sense techniques (e.g., microinjection of antisense molecules, or transfection with 
plasmids whose transcripts are anti-sense with regard to a nucleic acid RNA or 
nucleic acid sequence) can be used to investigate the role of one or more members of 

20 the KChlPl pathway in the development of disease-related conditions. Such 
techniques can be utilized in cell culture, but can also be used in the creation of 
transgenic animals. 

The therapeutic agents as described herein can be delivered in a composition, 
as described above, or alone. They can be administered systemically, or can be 

25 targeted to a particular tissue. The therapeutic agents can be produced by a variety of - 
means, including chemical synthesis; recombinant production; in vivo production 
(e.g., a transgenic animal, such as U.S. Patent NO: 4,873,316 to Meade et al.'), for 
example, and can be isolated using standard means such as those described herein. In 
addition, a combination of any of the above methods of treatment (e.g., administration 

30 of non-altered polypeptide in conjunction with antisense therapy targeting altered 
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mRNA; administration of a first splicing variant in conjunction with antisense therapy 
targeting a second splicing variant) can also be used. 

The invention additionally pertains to use of such therapeutic agents, as 
described herein, for the manufacture of a medicament for the treatment of Type If 
5 diabetes e.g., using the methods described herein. 

MONITORING PROGRESS OF TREATMENT 

The current invention also pertains to methods of monitoring the effectiveness 
of treatment on the regulation of expression (e.g., relative or absolute expression) of 

1 0 one or more KChJP 1 isoforms at the RNA or protein level or its enzymatic activity. 
KChlPl message or protein or enzymatic activity can be measured in a sample of 
peripheral blood or cells derived therefrom. An assessment of the levels of expression 
or activity can be made before and during treatment with KChlPl therapeutic agents. 
For example, in one embodiment of the invention, an individual who is a member of 

15 the target population can be assessed for response to treatment with a KChlPl 

inhibitor, by examining calcium levels or Kv channel-interacting proteins activity or 
absolute and/or relative levels of KChlPl protein or mRNA isoforms in peripheral 
blood in general or specific cell subtractions or combination of cellsubfractions. In 
addition, variation such as haplotypes ormutations within or near (within 100 to 

20 200kb) of the KChlPl gene may be used to identify individuals who are at higher risk 
for Type II diabetes to increase the power and efficiency of clinical trials for 
pharmaceutical agents to prevent or treat Type II diabetes. The haplotypes and other 
variations may be used to exclude or fractionate patients in a clinical trial who are 
likely to have non- KChIP 1 involvement in their Type II diabetes risk in order to 

25 enrich patients who have other genes or pathways involved and boost the power and 
sensitivity of the clinical trial. Such variation maybe used as a pharmacogenomic test 
to guide selection of pharmaceutical agents for individuals. 

Described herein is the first known linkage study of Type II diabetes showing 
a connection to chromosome 5q35. Based on the linkage studies conducted, a direct 

30 relationship between Type II diabetes and the locus on chromosome 5q35, in 
particular the KChlPl gene, has been discovered. 
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NUCLEIC ACEDS OF THE INVENTION 
KCMPl Nucleic Acids, Portions and Variants 
5 Accordingly, the invention pertains to isolated nucleic acid molecules 

comprising human KChlPl nucleic acid. The term, "KChlPl nucleic acid," as used 
herein, refers to an isolated nucleic acid molecule encoding a KChlPl polypeptide 
(e.g. 9 a KChlPl gene, such as shown in SEQ ID NO:l). The KChlPl nucleic acid 
molecules of the present invention can be RNA, for example, mRNA, or DNA, such 

10 as cDNA and genomic DNA. DNA molecules can be double-stranded or single- 
stranded; single stranded RNA or DNA can be either the coding, or sense, strand or 
the non-coding, or antisense strand. The nucleic acid molecule can include all or a 
portion of the coding sequence of the gene and can further comprise additional non- 
coding sequences such as introns and non-coding 3* and 5' sequences (including 

15 regulatory sequences, for example). 

For example, the KChlPl nucleic acid can the genomic sequence shown in 
FIG. 1, or a portion or fragment of the isolated nucleic acid molecule (e.g., cDNA or 
the gene) that encodes KChlPl polypeptide. In certain embodiments, the isolated 
nucleic acid molecule comprises a nucleic acid molecule selected from the group 

20 consisting of SEQ ID NOs: 1 and 1 14-258 (e.g., in Table 10) or the complement of 
such a nucleic acid molecule. 

Additionally, nucleic acid molecules of the inventioii can be fused to a marker 
sequence, for example, a sequence that encodes a polypeptide to assist in isolation or 
purification of the polypeptide. Such sequences include, but are not limited to, those 

25 that encode a glutathione-S-transferase (GST) fusion protein and those that encode a 
hemagglutinin A (HA) polypeptide marker from influenza. 

An "isolated" nucleic acid molecule, as used herein, is one that is separated 
from nucleic acids that normally flank the gene or nucleotide sequence (as in genomic 
sequences) and/or has been completely or partially purified from other transcribed 

30 sequences (e.g., as in an RNA library). For example, an isolated nucleic acid of the 
invention may be substantially isolated with respect to the complex cellular milieu in 
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which it naturally occurs, or culture medium when produced by recombinant 
techniques, or chemical precursors or other chemicals when chemically synthesized. 
In some instances, the isolated material will form part of a composition (for example, 
a crude extract containing other substances), buffer system or reagent mix. In other 
5 circumstances, the material may be purified to essential homogeneity, for example as 
determined by PAGE or column chromatography such as HPLC. Preferably, an 
isolated nucleic acid molecule comprises at least about 50, 80 or 90% (on a molar 
basis) of all macromolecular species present. With regard to genomic DNA, the term 
"isolated" also can refer to nucleic acid molecules that are separated from the 

10 chromosome with which the genomic DNA is naturally associated. For example, the 
isolated nucleic acid molecule can contain less than about 5 kb but not limited to 4 kb, 
3 kb, 2 kbj 1 kb, 0.5 kb or 0.1 kb of nucleotides which flank the nucleic acid molecule 
in the genomic DNA of the cell from which the nucleic acid molecule is derived. 
The nucleic acid molecule can be fused to other coding or regulatory 

15 sequences and still be considered isolated. Thus, recombinant DNA contained in a 
vector is included in the definition of "isolated" as used herein. Also, isolated nucleic 
acid molecules include recombinant DNA molecules in heterologous host cells, as 
well as partially or substantially purified DNA molecules in solution. "Isolated" 
nucleic acid molecules also encompass in vivo and in vitro RNA transcripts of the 

20 DNA molecules of the present invention. An isolated nucleic acid molecule can 
include a nucleic acid molecule or nucleic acid sequence that is synthesized 
chemically or by recombinant means. Therefore, recombinant DNA contained in a 
vector is included in the definition of "isolated" as used herein. Also, isolated nucleic 
acid molecules include recombinant DNA molecules in heterologous organisms, as 

25 well as partially or substantially purified DNA molecules in solution. In vivo and in 
vitro RNA transcripts of the DNA molecules of the present invention are also 
encompassed by "isolated" nucleic acid sequences. Such isolated nucleic acid 
molecules are useful in the manufacture of the encoded polypeptide, as probes for 
isolating homologous sequences {e.g., from other mammalian species), for gene 

30 mapping (e.g. , by in situ hybridization with chromosomes), or for detecting 
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expression of the gene in tissue (e.g., human tissue), such as by Northern blot 
analysis. 

The present invention also pertains to nucleic acid molecules which are not 

necessarily found in nature but which encode a KChlPl polypeptide, or another 

splicing variant of a KChlPl polypeptide or polymorphic variant thereof. Thus, for 

example, the invention pertains to DNA molecules comprising a sequence that is 

different from the naturally occurring nucleotide sequence but which, due to the 

degeneracy of the genetic code, encode a KChlPl polypeptide of the present 

invention. The invention also encompasses nucleic acid molecules encoding portions 

(fragments), or encoding variant polypeptides such as analogues or derivatives of a 

KChlPl polypeptide. Such variants can be naturally occurring, such as in the case of 

allelic variation or single nucleotide polymorphisms, or non-natoaUy-occurring, such 

as those induced by various mutagens and mutagenic processes. Intended variations 

include, but are not limited to, addition, deletion and substitution of one or more 

nucleotides that can result in conservative or non-conservative amino acid changes 

\ .... 
including additions and deletions. Preferably the nucleotide (and/or resultant amino 

acid) changes are silent or conserved; that is, they do not alter the characteristics or 

activity of a KChlPl polypeptide. In one embodiment, the nucleic acid sequences are 

fragments that comprise one or more polymorphic microsatellite markers. In another 

embodiment, the nucleotide sequences are fragments that comprise one or more single 

nucleotide polymorphisms in a KChlPl gene. 

Other alterations of the nucleic acid molecules of the invention can include, 

for example, labeling, methylation, internucleotide modifications such as uncharged 

linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates), 

charged linkages (e.g., phosphorothioates, phosphorodithioates), pendent moieties 

(e.g., polypeptides), intercalators (e.g., acridine, psoralen), chelators, alkylators, and 

modified linkages (e.g., alpha anomeric nucleic acids). Also included are synthetic 

molecules that mimic nucleic acid molecules in the ability to bind to a designated 

sequence via hydrogen bonding and other chemical interactions. Such molecules 

include, for example, those in which peptide linkages substitute for phosphate 

linkages in the backbone of the molecule. 
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The invention also pertains to nucleic acid molecules that hybridize under high 
stringency hybridization conditions, such as for selective hybridization, to a 
nucleotide sequence described herein (e.g., nucleic acid molecules which specifically 
hybridize to a nucleotide sequence encoding polypeptides described herein, and, 
5 optionally, have an activity of the polypeptide). In one embodiment, the invention 
includes variants described herein which hybridize under high stringency 
hybridization conditions (eg-., for selective hybridization) to a nucleotide sequence 
comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 
1 14-258. In another embodiment, the invention includes variants described herein 

10 that hybridize under high stringency hybridization conditions (e.g., for selective 
hybridization) to a nucleotide sequence encoding an amino acid sequence or a 
polymorphic variant thereof. In another embodiment, the variant that hybridizes 
under high stringency hybridizations has an activity of a KChlP 1 polypeptide. 
Such nucleic acid molecules can be detected and/or isolated by specific 

15 hybridization (e.g., under high stringency conditions). "Specific hybridization/' as 
used herein, refers to the ability of a first nucleic acid to hybridize to a second nucleic 
acid in a manner such that the first nucleic acid does not hybridize to any nucleic acid 
other than to the second nucleic acid (e.g., when the first nucleic acid has a higher 
similarity to the second nucleic acid than to any other nucleic acid in a sample 

20 wherein the hybridization is to be performed). "Stringency conditions" for 

hybridization is a term of art which refers to the incubation and wash conditions, e.g., 
conditions of temperature and buffer concentration, which permit hybridization of a 
particular nucleic acid to a second nucleic acid; the first nucleic acid may be perfectly 
(i.e., 100%) complementary to the second, or the first and second may share some 

25 degree of complementarity which is less than perfect (e.g., 70%, 75%, 85%, 90%, 

95%). For example, certain high stringency conditions can be used which distinguish 
perfectly complementary nucleic acids from those of less complementarity. "High 
stringency conditions", "moderate stringency conditions" and "low stringency 
conditions" for nucleic acid hybridizations are explained on pages 2.10.1-2.10.16 and 

30 pages 6.3.1-6.3.6 in Current Protocols 'in Molecular Biology (Ausubel, F.M. et ah, 
"Cun-ent Protocols in Molecular Biology'", John Wiley & Sons, (2001)), the entire 
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teachings of which are incorporated by reference herein). The exact conditions which 
determine the stringency of hybridization depend not only on ionic strength (e.g. 
0.2X SSC, 0.1X SSC), temperature (e.g., room temperature, 42°C, 68°C) and the 
concentration of destabilizing agents such as formamide or denaturing agents such as 
5 SDS, but also on factors such as the length of the nucleic acid sequence, base 

composition, percent mismatch between hybridizing sequences and the frequency of 
occurrence of subsets of that sequence within other non-identical sequences. Thus, 
equivalent conditions can be determined by varying one or more of these parameters 
while maintaining a similar degree of identity or similarity between the two nucleic 

10 acid molecules. Typically, conditions are used such that sequences at least about 

60%, at least about 70%, at least about 80%, at least about 90%, or at least about 95% 
or more identical to each other remain hybridized to one another. By varying 
hybridization conditions from a level of stringency at which no hybridization occurs 
to a level at which hybridization is first observed, conditions which will allow a given 

15 sequence to hybridize (e.g., selectively) with the most similar sequences in the sample 
can be determined. 

Exemplary conditions are described in Krause, M.H. and S,A. Aaronson, 
Methods in Enzymology 200:546-556 (1991), and in, Ausubel, et al 9 "Current 
Protocols in Molecular Biology", John Wiley & Sons, (2001), which describes the 

20 determination of washing conditions for moderate or low stringency conditions. 

Washing is the step in which conditions are usually set so as to determine a minimum 
level of complementarity of the hybrids. Generally, starting from the lowest 
temperature at which only homologous hybridization occurs, each °C by which the 
final wash temperature is reduced (holding SSC concentration constant) allows an 

25 increase by 1% in the maximum extent of mismatching among the sequences that 
hybridize. Generally, doubling the concentration of SSC results in an increase in T m 
of -17°C. Using these guidelines, the washing temperature can be determined 
empirically for high, moderate or low stringency, depending on the level of mismatch 
sought. 

30 For example, a low stringency wash can comprise washing in a solution 

containing 0.2X SSC/0.1% SDS for 10 minutes at room temperature; a moderate 
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stringency wash can comprise washing in a pre-waimed solution (42°C) solution 
containing 0.2X SSC/0. 1 % SDS for 1 5 minutes at 42°C; and a high stringency wash 
can comprise washing in pre-warmed (68°C) solution containing 0.1X SSC/0. 1%SDS 
for 15 minutes at 68°C. Furthermore, washes can be performed repeatedly or 
5 sequentially to obtain a desired result as known in the art. Equivalent conditions can 
be determined by varying one or more of the parameters given as an example, as 
known in the art, while maintaining a similar degree of identity or similarity between 
the target nucleic acid molecule and the primer or probe used. 

The percent homology or identity of two nucleotide or amino acid sequences 

10 can be determined by aligning the sequences for optimal comparison purposes {e.g. , 
gaps can be introduced in the sequence of a first sequence for optimal alignment). 
The nucleotides or amino acids at corresponding positions are then compared, and the 
percent identity between the two sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # of identical positions/total # of 

15 positions x 100). When a position in one sequence is occupied by the same nucleotide 
or amino acid residue as the corresponding position in the other sequence, then the 
molecules are homologous at that position. As used herein, nucleic acid or amino 
acid "homology" is equivalent to nucleic acid or amino acid "identity". In certain 
embodiments, the length of a sequence aligned for comparison purposes is at least 

20 30%, for example, at least 40%, in certain embodiments at least 60%, and in other 

embodiments at least 70%, 80%, 90% or 95% of the length of the reference sequence. 
The actual comparison of the two sequences can be accomplished by well-known 
methods, for example, using a mathematical algorithm. A preferred, non-limiting 
example of such a mathematical algorithm is described in KarUn et al, Proc. Natl 

25 Acad. Set USA 90:5873-5877 (1993). Such an algorithm is incorporated into the 
NBLAST and XBLAST programs (version 2.0) as described in Aitschul et al 9 
Nucleic Acids Res. 25:389-3402 (1997). When utilizing BLAST and Gapped BLAST 
programs, the default parameters of the respective programs (e.g., NBLAST) can be 
used. In one embodiment, parameters for sequence comparison can be set at 

30 score=100, wordlength=12, or can be varied (e.g., W=5 or W=20). 
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Aiiother prefen-ed, non-limiting example of a mathematical algorithm utilized 
for the comparison of sequences is the algorithm of Myers and Miller, CABIOS 4(1): 
1 1-17 (1988). Such an algorithm is incorporated into the ALIGN program (version 
2.0) which is part of the GCG sequence alignment software package (Accelrys, 
5 Cambridge, UK). When utilizing the ALIGN program for comparing amino acid 
sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap 
penalty of 4 can be used. Additional algorithms for sequence analysis are known in 
the art and include ADVANCE and ADAM as described in Torellis and Robotti, 
Comput Appl. BioscL 10:3-5 (1994); and FASTA described in Pearson and Lipman, 

10 Proc. Natl. Acad. Set USA 85:2444-8 (1988). 

In another embodiment, the percent identity between two amino acid 
sequences can be accomplished using the GAP program in the GCG software package 
using either a BLOSUM63 matrix or a PAM250 matrix, and a gap weight of 12, 1 0, 8, 
6, or 4 and a length weight of 2, 3, or 4. In yet another embodiment, the percent 

15 identity between two nucleic acid sequences can be accomplished using the GAP 
program in the GCG software package using a gap weight of 50 and a length weight 
of 3. 

The present invention also provides isolated nucleic acid molecules that 
contain a fragment or portion that hybridizes under highly stringent conditions to a 

20 nucleotide sequence comprising a nucleotide sequence selected from the group . 
consisting of SEQ ID NOs: 1, 1 14-258, or the comple ment of s uch a s equence, and 
also provides isolated nucleic acid molecules that contain a fragment or portion that 
hybridizes under highly stringent conditions to a nucleotide sequence encoding an 
amino acid sequence or polymorphic variant thereof. The nucleic acid fragments of 

25 the invention are at least about 15, preferably at least about 1 8, 20, 23 or 25 

nucleotides, and can be 30, 40, 50, 100, 200 or more nucleotides in length. Longer 
fragments, for example, 30 or more nucleotides in length, that encode antigenic 
polypeptides described herein are particularly useful, such as for the generation of 
antibodies as described below. 
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Probes and Primers * 

In a related aspect, the nucleic acid fragments of the invention are used as 
probes or primers in assays such as those described herein. "Probes" or "primers" are 
oligonucleotides that hybridize in a base-specific manner to a complementary strand 
5 of nucleic acid molecules. Such probes and primers include polypeptide nucleic 
acids, as described in Nielsen et al 9 Science 254:1497-1500 (1991). 

A probe or primer comprises a region of nucleotide sequence that hybridizes 
to at least about 15, for example about 20-25, and in certain embodiments about 40, 
50 or 75, consecutive nucleotides of a 'nucleic acid molecule comprising a contiguous 

10 nucleotide sequence selected from the group consisting of SEQ ID NOs: 1, 1 14-258 
or polymorphic variant thereof. In other embodiments, a probe or primer comprises 
100 or fewer nucleotides, in certain embodiments from 6 to 50 nucleotides, for 
example from 12 to 30 nucleotides. In other embodiments, the probe or primer is at 
least 70% identical to the contiguous nucleotide sequence or to the complement of the 

15 contiguous nucleotide sequence, for example at least 80% identical, in certain 

embodiments at least 90% identical, and in other embodiments at least 95% identical, 
or even capable of selectively hybridizing to the contiguous nucleotide sequence or to 
the complement of the contiguous nucleotide sequence. Often, the probe or primer 
further comprises a label, e.g., radioisotope, fluorescent compound, enzyme, or 

20 enzyme co-factor. 

The nucleic acid molecules of the invention such as those described above can 
be identified and isolated using standard molecular biology techniques and the 
. sequence information provided herein. For example, nucleic acid molecules can be 
amplified and isolated by the polymerase chain reaction using synthetic 

25 oligonucleotide primers designed based on one or more of the sequences selected 
from the group consisting of SEQ ID NOs: 1, 1 14-258 or the complement of such a 
sequence, or designed based on nucleotides based on sequences encoding one or more 
of the amino acid sequences provided herein. See generally PCR Technology: 
Principles and Applications for DNA Amplification (ed. H .A. Erlich, Freeman Press, 

30 NY, NY, 1 992) ; PCR Protocols: A Guide to Methods and Applications (Eds. Innis et 
al 9 Academic Press, San Diego, CA, 1990); Mattila et at, Nucl Acids Res. 19: 4967 
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(1991); Eckert et al, PCR Methods and Applications 1:17 (1991); PCR (eds. 
McPherson et al y IRL Press, Oxford); and U.S. Patent 4,683,202. The nucleic acid 
molecules can be amplified using cDNA, mRNA or genomic DNA as a template, 
cloned into an appropriate vector and characterized by DNA sequence analysis. 
5 Other suitable amplification methods include the ligase chain reaction (LCR) 

(see Wu and Wallace, Genomics 4:560 (1989), Landegren et al 9 Science 241: 1077 

(1988) , transcription amplification (Kwoh et al, Proc. Natl Acad. Sci. USA 86: 1 1 73 

(1989) ), and self-sustained sequence replication (Guatelli et al 9 Proc. Nat Acad. Set 
USA 87: 1874 (1990)) and nucleic acid based sequence amplification (NASBA). The 

10 latter two amplification methods involve isothermal reactions based on isothermal 
transcription, which produce both single stranded KNA (ssRNA) and double stranded 
DNA (dsDNA) as the amplification products in a ratio of about 30 or 1 00 to 1, 
respectively. 

The amplified DNA can be labeled, for example, radiolabeled, and used as a 

15 probe for screening a cDNA library derived from human cells, mRNA in zap express, 
ZIPLOX or other suitable vector. Corresponding clones can be isolated, DNA can 
obtained following in vivo excision, and the cloned insert can be sequenced in either 
or both orientations by art recognized methods to identify the correct reading frame 
encoding a polypeptide of the appropriate molecular weight. For example, the direct 

20 analysis of the nucleotide sequence of nucleic acid molecules of the present invention 
can be accomplished using well-known methods that are commercially available. 
See, for example, Sambrook etal, Molecular Cloning, A Laboratory Manual (2nd 
Ed., CSHP, New York 1989); Zyskind et al, Recombinant DNA Laboratory Manual, 
(Acad. Press, 1988)). Additionally, fluorescence methods are also available for 

25 analyzing nucleic acids (Chen et al, Genome Res. 9, 492 (1999)) and polypeptides. 
Using these or similar methods, the polypeptide and the DNA encoding the 
polypeptide can be isolated, sequenced and further characterized. 

Antisense nucleic acid molecules of the invention can be designed using the 
nucleotide sequences of one or more of SEQ ID NOs: 1,114-258 and/or the 

30 complement of one or more of SEQ ID NOs: 1, 1 14-258 and/or a portion of one or 
more of SEQ ID NOs: 1, 1 14-258 or the complement of one or more of SEQ ID NOs: 



.200404 1193A2J_> 



WO 2004/041193 



PCT/US2003/034681 



-34- 

1, 1 14-258 and constructed using chemical synthesis and enzymatic ligation reactions 
using procedures known in the art. For example, an antisense nucleic acid molecule 
(e.g., an antisense oligonucleotide) can be chemically synthesized using naturally 
occurring nucleotides or variously modified nucleotides designed to increase the 
5 biological stability of the molecules or to increase the physical stability of the duplex 
formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. Alternatively, the 
antisense nucleic acid molecule can be produced biologically using an expression 
vector into which a nucleic acid molecule has been subcloned in an antisense 

10 orientation {i.e., RNA transcribed from the inserted nucleic acid molecule will be of 
an antisense orientation to a target nucleic acid of interest). 

The nucleic acid sequences can also be used to compare with endogenous . 
DNA sequences in patients to identify one or more of the disorders described above, 
and as probes, such as to hybridize and discover related DNA sequences or to subtract 

15 out known sequences from a sample. The nucleic acid sequences can further be used 
to derive primers for genetic fingerprinting, to raise anti-polypeptide antibodies using 
DNA immunization techniques, and as an antigen to raise anti-DNA antibodies or 
elicit immune responses. Portions or fragments of the nucleotide sequences identified 
herein (and the corresponding complete gene sequences) can be used in numerous 

20 ways as polynucleotide reagents. For example, these sequences can be used to: (i) 
map their respective genes on a chromosome; and, thus, locate gene regions 
associated with genetic disease; (ii) identify an individual from a minute biological 
sample (tissue typing); and (iii) aid in forensic identification of a biological sample. 
Additionally, the nucleotide sequences of the invention can be used to identify and 

25 express recombinant polypeptides for analysis, characterization or therapeutic use, or 
as markers for tissues in which the corresponding polypeptide is expressed, either 
constitutively, during tissue differentiation, or in diseased states. The nucleic acid 
sequences can additionally be used as reagents in the screening and/or diagnostic 
assays described herein, and can also be included as components of kits (e.g., reagent 

30 kits) for use in the screening and/or diagnostic assays described herein. 
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Vectors and Host Cells 

Another aspect of the invention pertains to nucleic acid constructs containing a 
nucleic acid molecule selected from the group consisting of SEQ ID NOs: 1, 1 14-258 
and the complements thereof (or a portion thereof). The constructs comprise a vector 
5 {e.g., an expression vector) into which a sequence of the invention has been inserted 
in a sense or antisense orientation. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another nucleic acid to which it has 
been linked. One type of vector is a "plasmid" which refers to a circular double 
stranded DNA loop into which additional DNA segments can be ligated. Another 

10 type of vector is a viral vector, wherein additional DNA segments can be ligated into 
the viral genome. Certain vectors are capable of autonomous replication in a host cell 
into which they are introduced {e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors {e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction 

15 into the host cell, and thereby are replicated along with the host genome. Expression 
vectors are capable of directing the expression of genes to which they are operably 
linked. In general, expression vectors of utility in recombinant DNA techniques are 
often in the form of plasmids. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors {e.g. , replication defective 

20 retroviruses, adenoviruses and adeno-iassociated viruses) that serve equivalent 
v functions. 

In certain embodiments, recombinant expression vectors of the invention 
comprise a nucleic acid molecule of the invention in a form suitable for expression of 
the nucleic acid molecule in a host cell. This means that the recombinant expression 

25 vectors include one or more regulatory sequences, selected on the basis of the host 
cells to be used for expression, which is operably linked to the nucleic acid sequence 
to be expressed. Within a recombinant expression vector, "operably linked" or 
"operatively linked" is intended to mean that the nucleotide sequence of interest is 
linked to the regulatory sequence(s) in a manner which allows for expression of the 

30 nucleotide sequence {e.g., in an in vitro transcription/translation system or in a host 
cell when the vector is introduced into the host cell). The term "regulatory sequence" 
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is intended to include promoters, enhancers and other expression control elements 
(e.g., polyadenylation signals). Such regulatory sequences are described, for example, 
in Goeddel, "Gene Expression Technology", Methods in Enzymology 1 85, Academic 
Press, San Diego, CA.(1990). Regulatory sequences include those which direct 
5 constitutive expression of a nucleotide sequence in many types of host cell and those 
which direct expression of the nucleotide sequence only in certain host cells (e.g. , 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of 
the host cell to be transformed and the level of expression of polypeptide desired. The 

10 expression vectors of the invention can be introduced into host cells to thereby 
produce polypeptides, including fusion polypeptides, encoded by nucleic acid 
molecules as described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., 

15 bacterial cells such as E. coli, insect cells (using baculovirus expression vectors), 
yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
supra. Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

20 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms <c host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but also to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 

25 generations due to either mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are still included within the scope of the 
term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic 
acid molecule of the invention can be expressed in bacterial cells (e.g., E. coli\ insect 

30 cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS 
cells). Other suitable host cells are known to those skilled in the art. 
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Vector DNA can be introduced into prokaryotic or eulcaryo tic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing a foreign nucleic acid molecule (e.g., DNA) info a host 
5 cell, including calcium phosphate or calcium chloride co-precipitation, DEAE- 

dextran-mediated transfection, lipofection, or electroporation. Suitable methods for 
transforming or transfecting host cells can be found in Sambrook, et ah, (supra), and 
other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 

10 the expression vector and transfection technique used, only a small fraction of cells 
may integrate the foreign DNA into their genome. In order to identify and select 
these integrants, a gene that encodes a selectable marker (e.g., for resistance to 
antibiotics) is generally introduced into the host cells along with the gene of interest. 
, Preferred selectable markers include those that confer resistance to drugs, such as 

15 G41 8, hygromycin and methotrexate. Nucleic acid molecules encoding a selectable 
marker can be introduced into a host cell on the same vector as the nucleic acid 
molecule of the invention or can be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid molecule can be identified by drug 
selection (e.g., cells that have incorporated the selectable marker gene will survive, 

20 while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) a polypeptide of the invention. 
Accordingly, the invention further provides methods for producing a polypeptide 
using the host cells of the invention. In one embodiment, the method comprises 

25 culturing the host cell of invention (into which a recombinant expression vector 
encoding a polypeptide of the invention has been introduced) in a suitable medium 
such that the polypeptide is produced. In another embodiment, the method further 
comprises isolating the polypeptide from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 

30 transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which a nucleic acid molecule of the 
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invention has been introduced (e.g., an exogenous KChlPl gene, or an exogenous 
nucleic acid encoding a KChlPl polypeptide). Such host cells can then be used to 
create non-human transgenic animals in which exogenous nucleotide sequences have 
been introduced into the genome or homologous recombinant animals in which 
endogenous nucleotide sequences have been altered. Such animals are useful for 
studying the function and/or activity of the nucleotide sequence and polypeptide 
encoded by the sequence and for identifying and/or evaluating modulators of their 
activity. As used herein, a "transgenic animal" is a non-human animal, preferably a 
mammal, more preferably a rodent such as a rat or mouse, in which one or more of 
the cells of the animal include a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens and amphibians. A 
transgene is exogenous DNA which is integrated into the genome of a cell from which 
a transgenic animal develops and which remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene product in one or more cell types 
or tissues of the transgenic animal. As used herein, an "homologous recombinant 
animal" is a non-human animal, preferably a mammal, more preferably a mouse, in 
which an endogenous gene has been altered by homologous recombination between 
the endogenous gene and an exogenous DNA molecule introduced into a cell of the 
animal, e.g., an embryonic cell of the animal, prior to development of the animal. 

Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Pat. NO: 4,873,191 and in Hogan, Manipulating the Mouse Embryo (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Methods for constructing 
homologous recombination vectors and homologous recombinant animals are 
described further in Bradley, Current Opinion in BioTechnology 2:823-829 (1991) 
and in PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 92/0968, and.WO 
93/041 69. Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut et aL, Nature 385:810-813 
(1997) and PCT Publication Nos. WO 97/07668 and WO 97/07669. 
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POLYPEPTIDES OF THE INVENTION 

The present invention also pertains to isolated polypeptides encoded by 
KChlPl nucleic acids ("KChlPl polypeptides ," or "KChlPl proteins," such as the 
protein shown in SEQ ID NO: 2) and fragments and variants thereof, as well as 
5 polypeptides encoded by nucleotide sequences described herein {e.g., other splicing 
variants). The term "polypeptide" refers to a polymer of amino acids, and not to a 
specific length; thus, peptides, oligopeptides and proteins are included within the 
definition of a polypeptide. As used herein, a polypeptide is said to be "isolated" or 
"purified" when it is substantially free of cellular material when it is isolated from 

10 recombinant and non-recombinant cells, or free of chemical precursors or other 

chemicals when it is chemically synthesized. A polypeptide, however, can be joined 
to another polypeptide with which it is not normally associated in a cell (e.g., in a 
"fusion protein") and still be "isolated" or "purified." 

The polypeptides of the invention can be purified to homogeneity. It is 

15 understood, however, that preparations in which the polypeptide is not purified to 
homogeneity are useful. The critical feature is that the preparation allows for the 
desired function of the polypeptide, even in the presence of considerable amounts of 
other components. Thus, the invention encompasses various degrees of purity. In one 
embodiment, the language "substantially free of cellular material" includes 

20 preparations of the polypeptide having less than about 30% (by dry weight) other 
proteins (i.e., contaminating protein), less than about 20% other proteins, less than 
about 10% other proteins, or less than about 5% other proteins. 

When a polypeptide is recombinantly produced, it can also be substantially 
free of culture medium, i.e., culture medium represents less than about 20%, less than 

25 about 10%, or less than about 5% of the volume of the polypeptide preparation. The 
language "substantially free of chemical precursors or other chemicals" includes 
preparations of the polypeptide in which it is separated from chemical precursors or 
other chemicals that are involved in its synthesis. In one embodiment, the language 
"substantially free of chemical precursors or other chemicals" includes preparations of 

30 the polypeptide having less than about 30% (by dry weight) chemical precursors or 
other chemicals, less than about 20% chemical precursors or other chemicals, less 
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than about 10% chemical precursors or other chemicals, or less than about 5% 
chemical precursors or other chemicals. 

In one embodiment, a polypeptide of the invention comprises an amino acid 
sequence encoded by a nucleic acid molecule comprising a nucleotide sequence of 
5 SEQ ID NO: 1 , optionally additionally comprising one or more of SEQ ID NOs: 114- 
258; or the complement of such a nucleic acid, or portions thereof, or a portion or 
polymorphic variant thereof. However, the polypeptides of the invention also 
encompass fragment and sequence variants. Variants include a substantially 
homologous polypeptide encoded by the same genetic locus in an organism, Le. 9 an 
10 allelic variant, as well as other splicing variants. Variants also encompass 

polypeptides derived from other genetic loci in an organism, but having substantial 
homology to a polypeptide encoded by a nucleic acid molecule comprising a 
nucleotide of SEQ ID NO: 1, optionally additionally one or more of SEQ ID NOs: 
1 14-258; or a complement of such a sequence, or portions thereof or polymorphic 
15 variants thereof. Variants also include polypeptides substantially homologous or 
identical to these polypeptides but derived from another organism, i.e. 9 an ortholog. 
Variants also include polypeptides that are substantially homologous or identical to 
these polypeptides that are produced by chemical synthesis. Variants also include 
polypeptides that are substantially homologous or identical to these polypeptides that 
20 are produced by recombinant methods. 

As used herein, two polypeptides (or a region of the polypeptides) are 
substantially homologous or identical when the amino acid sequences are at least 
about 45-55%, in certain embodiments at least about 70-75%, and in other 
embodiments at least about 80-85%, and in other embodiments greater than about 
25 90% or more homologous or identical. A substantially homologous amino acid 
sequence, according to the present invention, will be encoded by a nucleic acid 
molecule hybridizing to of SEQ ID NO: 1 or any one of 1 14-258 or portion thereof, 
under stringent conditions as more particularly described above, or will be encoded by 
a nucleic acid molecule hybridizing to a nucleic acid sequence encoding SEQ ID NO: 
30 1 or any one of 1 14-258 or a portion thereof or polymorphic variant thereof, under 
stringent conditions as more particularly described thereof. 
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The invention also encompasses polypeptides having a lower degree of 
identity but having sufficient similarity so as to perform one or more of the same 
functions performed by a polypeptide encoded by a nucleic acid molecule of the 
invention. 

5 Similarity is determined by conserved amino acid substitution where a given 

amino acid in a polypeptide is substituted by another amino acid of like 
characteristics. Conservative substitutions are likely to be phenotypically silent. 
Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and He; interchange of the hydroxyl 

10 residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution 

between the amide residues Asn and Gin, exchange of the basic residues Lys and Arg 
and replacements among the aromatic residues Phe and Tyr. Guidance concerning 
which amino acid changes are likely to be phenotypically silent are found in Bowie et 
al. 9 Science 247:1306-13 10 (1990). 

15 A variant polypeptide can differ in amino acid sequence by one or more 

substitutions, deletions, insertions, inversions, fusions, and truncations or a 
combination of any of these. Further, variant polypeptides can be fully functional or 
can lack function in one or more activities. Fully functional variants typically contain 
only conservative variation or variation in non-critical residues or in non-critical 

20 regions. Functional variants can also contain substitution of similar amino acids that 
result in no change or an insignificant change in function. Alternatively, such 
substitutions may positively or negatively affect function to some degree. Non- 
functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, 

25 insertion, inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods 
known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis 
(Cunningham et aL, Science 244:1082-1185 (1989)). The latter procedure introduces 
single alanine mutations at every residue in the molecule. The resulting mutant 

30 molecules are then tested for biological activity in vitro, or in vitro proliferative 
activity. Sites that are critical for polypeptide activity can also be determined by 
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structural analysis such as crystallization, nuclear magnetic resonance or photoafSnity 
labeling (Smith et al,J. Mol Biol. 224:899-904 (1992); de Vos etaL, Science 
255:306-312 (1992)). 

The invention also includes polypeptide fragments of the polypeptides of the 
5 invention. Fragments can be derived from a polypeptide encoded by a nucleic acid 
molecule comprising SEQ ID NO: 1 and optionally comprising one or more of SEQ 
ID NOs: 1 14-258; or a complement of such a nucleic acid or other variants. 
However, the invention also encompasses fragments of the variants of the 
polypeptides described herein. As used herein, a fragment comprises at least 6 

10 contiguous amino acids. Useful fragments include those that retain one or more of the ' 
biological activities of the polypeptide as well as fragments that can be used as an 
immunogen to generate polypeptide-specific antibodies. 

Biologically active fragments (peptides which are, for example, 6, 9, 12, 15, 
16, 20, 30, 35, 36, 37, 38, 39, 40, 50, 100 or more amino acids in length) can comprise 

15 a domain, segment, or motif that has been identified by analysis of the polypeptide 
sequence using well-known methods, e.g., signal peptides, extracellular domains, one 
or more transmembrane segments or loops, ligand binding regions, zinc finger 
domains, DNA binding domains, acylation sites, glycosylation sites, or 
phosphorylation sites. 

20 Fragments can be discrete (not fused to other amino acids or polypeptides) or 

can be within a larger polypeptide. Further, several fragments can be comprised 
within a single larger polypeptide. In one embodiment a fragment designed for 
expression in a host can have heterologous pre- and pro-polypeptide regions fused to 
the amino terminus of the polypeptide fragment and an additional region fused to the 

25 carboxyl terminus of the fragment. 

The invention thus provides chimeric or fusion polypeptides. These comprise 
a polypeptide of the invention operatively linked to a heterologous protein or 
polypeptide having an amino acid sequence not substantially homologous to the 
polypeptide. 

30 "Operatively linked" indicates that the polypeptide and the heterologous 

protein are fused in-frame. The heterologous protein can be fused to the N-terminus 
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or C-terminus of the polypeptide. In one embodiment the fusion polypeptide does not 
affect function of the polypeptide per se. For example, the fusion polypeptide can be 
a GST-fusion polypeptide in which the polypeptide sequences are fused to the C- 
terminus of the GST sequences. Other types of fusion polypeptides include, but are 
5 not limited to, enzymatic fusion polypeptides, for example beta-galactosidase fusions, 
yeast two-hybrid GAL fusions, poly-His fusions and Ig fusions. Such fusion 
polypeptides, particularly poly-His fusions, can facilitate the purification of 
recombinant polypeptide. In certain host cells (e.g., mammalian host cells), 
expression and/or secretion of a polypeptide can be increased using a heterologous 

10 signal sequence. Therefore, in another embodiment, the fusion polypeptide contains a 
heterologous signal sequence at its N- terminus. 

EP-A-O 464 533 discloses fusion proteins comprising various portions of 
immunoglobulin constant regions. The Fc is useful in therapy and diagnosis and thus 
results, for example, in improved pharmacokinetic properties (EP-A 0232 262). In 

15 drug discovery, for example, human proteins have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists. Bennett et ai, 
Journal of Molecular Recognition, 5:52-58 (1995) and Johanson et al. 9 Tlie Journal of 
Biological Chemistry, 270,16:9459-9471 (1995). Thus, this invention also 
encompasses soluble fusion polypeptides containing a polypeptide of the invention 

20 and various portions of the constant regions of heavy or light chains of 
immunoglobulins of various subclasses (IgG, IgM, IgA, IgE). 

A chimeric or fusion polypeptide can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional techniques. 

25 In another embodiment, the fusion gene can be synthesized by conventional 

techniques including automated DNA synthesizers. Alternatively^ PCR amplification 
of nucleic acid fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive nucleic acid fragments which can 
subsequently be annealed and re-amplified to generate a chimeric nucleic acid 

30 sequence (see Ausubel et aL 9 Current Protocols in Molecular Biology, 1992). 
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Moreover, many expression vectors are commercially available that already 
encode a fusion moiety (e.g., a GST protein). A nucleic acid molecule encoding a 
polypeptide of the invention can be cloned into such an expression vector such that 
the fusion moiety is linked in-frame to the polypeptide. 
5 The isolated polypeptide can be purified from cells that naturally express it, 

can be purified from cells that have been altered to express it (recombinant), or 
synthesized using known protein synthesis methods. In one embodiment, the 
polypeptide is produced by recombinant DNA techniques. For example, a nucleic 
acid molecule encoding the polypeptide is cloned into an expression vector, the 

10 expression vector introduced into a host cell and the polypeptide expressed in the host 
cell. The polypeptide can then be isolated from the cells by an appropriate 
purification scheme using standard protein purification techniques. 

The polypeptides of the present invention can be used to raise antibodies or to 
elicit an immune response. The polypeptides can also be used as a reagent, e.g., a 

15 labeled reagent, in assays to quantitatively determine levels of the polypeptide or a 
molecule to which it binds (e.g., a ligand) in biological fluids. The polypeptides can 
also be used as markers for cells or tissues in which the corresponding polypeptide is 
preferentially expressed, either constitutively, during tissue differentiation, or in a 
diseased state. The polypeptides can be used to isolate a corresponding binding agent, 

20 e.g., ligand or receptor, such as, for example, in an interaction trap assay, and to 

screen for peptide or small molecule antagonists or agonists of the binding interaction. 

ANTIBODIES OF THE INVENTION 

Polyclonal antibodies and/or monoclonal antibodies that specifically bind one 

25 form of the gene product but not to the other form of the gene product are also 

provided. Antibodies are also provided which bind a portion of either the variant or 
the reference gene product that contains the polymorphic site or sites. The term 
"antibody" as used herein refers to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., molecules that contain an antigen 

30 binding site that specifically bind an antigen. A molecule that specifically binds to a 
polypeptide of the invention is a molecule that binds to that polypeptide or a fragment 
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thereof, but does not substantially bind other molecules in a sample, e.g., a biological 
sample, which naturally contains the polypeptide. Examples of immunologically 
active portions of immunoglobulin molecules include F(ab) and F(ab'>2 fragments 
which can be generated by treating the antibody with an enzyme such as pepsin. The 

5 invention provides polyclonal and monoclonal antibodies that bind to a polypeptide of 
the invention. The term "monoclonal antibody* or "monoclonal antibody 
composition", as used herein, refers to a population of antibody molecules that 
contain only one species of an antigen binding site capable of immunoreacting with a 
particular epitope of a polypeptide of the invention. A monoclonal antibody 

10 composition thus typically displays a single binding affinity for a particular 
polypeptide of the invention with which it immunoreacts. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a desired immunogen, e.g., polypeptide of the invention or a 
fragment thereof. The antibody titer in the immunized subject can be monitored over 

15 time by standard techniques, such as with an enzyme linked immunosorbent assay . 
(ELIS A) using immobilized polypeptide. If desired, the antibody molecules directed 
against the polypeptide can be isolated from the mammal (e.g., from the blood) and 
further purified by well-known techniques, such as protein A chromatography to 
obtain the IgG fraction. At an appropriate time after immunization, eg-., when the 

20 antibody titers are highest; antibody-producing cells can be obtained from the subject 
and used to prepare monoclonal antibodies by standard techniques, such as the 
hybridoma technique originally described by Kohler and Milstein, Nature 256:495- 
497 (1975), the human B cell hybridoma technique (Kozbor et al, Immunol. Today 4: 
72 (1983)), the EBV-hybridoma technique (Cole et al., Monoclonal Antibodies and 

25 Cancer Therapy, Alan R. Liss,1985, Inc., pp. 77-96) or trioma techniques. The 

technology for producing hybridomas is well known (see generally Current Protocols 
in Immunology (1994) Coligan et al, (eds.) John Wiley & Sons, Inc., New York, 
NY), Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with an immunogen as described 

30 above, and the culture supernatants of the resulting hybridoma cells are screened to 
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identify a hybridoma producing a monoclonal antibody that binds a polypeptide of the 
invention. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating a monoclonal 
5 antibody to a polypeptide of the invention (see, e.g. , Current Protocols in 
Immunology, supra; Galfre et al, Nature 266:55052 (1977); R.H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); and Lemer, Yale J. Biol Med. 54:387-402 
(1981)). Moreover, the ordinarily skilled worker will appreciate that there are many 

10 variations of such methods that also would be useful. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal antibody to a polypeptide of the invention can be identified and isolated 
by screening a recombinant combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with the polypeptide to thereby isolate immunoglobulin library 

15 members that bind the polypeptide. Kits for generating and screening phage display 
libraries are commercially available (e.g., the Pharmacia Recombinant Phage y " 
Antibody System, Catalog NO: 27-9400-01; and the Stratagene SwfeAP™ Phage? 
Display Kit, Catalog NO: 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library can 

20 be found in, for example, U.S. Patent NO: 5,223,409; PCT Publication NO: WO 

92/18619; PCT Publication NO: WO 91/17271; PCT Publication NO: WO 92/20791; 
PCT Publication NO: WO 92/15679; PCT Publication NO: WO 93/01288; PCT 
Publication NO: WO 92/01047; PCT Publication NO: WO 92/09690; PCT 
Publication NO: WO 90/02809; Fuchs et al, Bio/Technology 9: 1370-1372 (1991); 

25 Hay et al, Hum. Antibod. Hybridomas 3:81-85 (1992); Huse et al, Science 246: 
1275-1281 (1989); and Griffiths et al, EMBO J. 12:725-734 (1993). 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can 
be made using standard recombinant DNA techniques, are within the scope of the 

30 invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art. 
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Ih general, antibodies of the invention (e.g., a monoclonal antibody) can be 
used to isolate a polypeptide of the invention by standard techniques, such as affinity 
chromatography or immunoprecipitation. A polypeptide-specific antibody can 
facilitate the purification of natural polypeptide from cells and of recombinantly 
5 produced polypeptide expressed in host cells. Moreover, an antibody specific for a 
polypeptide of the invention can be used to detect the polypeptide (e.g., in a cellular 
lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and 
pattern of expression of the polypeptide. Antibodies can be used diagnostically to 
monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 

10 example, determine the efficacy of a given treatment regimen. The antibody can be 
coupled to a detectable substance to facilitate its detection. Examples of detectable 
substances include various enzymes, prosthetic groups, fluorescent materials, 
luminescent materials, bioluminescent materials, and radioactive materials. Examples 
of suitable enzymes include horseradish peroxidase, alkaline phosphatase, beta- 

15 galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dicUorotriazinylamine fluorescein, dansyl chloride or phycoeiythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 

20 materials include luciferase, luciferin, and aequorin, and examples of suitable 
radioactive material include 125 I > l31 I, 35 S or 3 H. 

DIAGNOSTIC ASSAYS 

The nucleic acids, probes, primers, polypeptides and antibodies described 

25 herein can be used in methods of diagnosis of Type II diabetes; of a susceptibility to 
Type II diabetes; or of a condition associated with, a KChlPl gene, as well as in kits 
(e.g., useful for diagnosis of Type II diabetes; a susceptibility to Type II diabetes; or a 
condition associated with a KChlPl gene). In one embodiment, the kit comprises 
primers which contain one or more of the SNP 's identified in Table 10. 

30 In one embodiment of the invention, diagnosis of a disease or condition 

associated with a KChlPl gene (e.g., diagnosis of Type II diabetes, or of a 
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susceptibility to Type II diabetes) is made by detecting a polymorphism in a KChlPl 
nucleic acid as described herein. The polymorphism can be a change in a KChlPl 
nucleic acid, such as the insertion or deletion of a single nucleotide, or of more than 
one nucleotide, resulting in a frame shift; the change of at least one nucleotide, 
5 resulting in a change in the encoded amino acid; the change of at least one nucleotide, 
resulting in the generation of a premature stop codon; the deletion of several 
nucleotides, resulting in a deletion of one or more amino acids encoded by the 
nucleotides; the insertion of one or several nucleotides, such as by unequal 
recombination or gene conversion, resulting in an interruption of the coding sequence 

10 of the gene; duplication of all or a part of the gene; transposition of all or a part of the 
gene; or rearrangement of all or a part of the gene. More than one such change may 
be present in a single gene. Such sequence changes cause a difference in the 
polypeptide encoded by a KChlPl nucleic acid. For example, if the difference is a 
frame shift change, the frame shift can result in a change in the encoded amino acids, 

15 and/or can result in the generation of a premature stop codon, causing generation of a 
truncated polypeptide. Alternatively, a polymorphism aissociated with a disease or 
condition of a susceptibility to a disease or condition associated with a KChlPl 
nucleic acid can be a synonymous alteration in one or more nucleotides (i.e., an 
alteration that does not result in a change in the polypeptide encoded by a KChlPl 

20 nucleic acid). Such a polymorphism may alter splicing sites, affect the stability or 
transport of mKNA, or otherwise affect the transcription or translation of the gene. A 
KChlPl nucleic acid that has any of the changes or alterations described above is 
referred to herein as an "altered nucleic acid.'* 

In a first method of diagnosing Type II diabetes or a susceptibility to Type H 

25 diabetes, or another disease or condition associated with a KChlPl gene, 

hybridization methods, such as Southern analysis, Northern analysis, or in situ 
hybridizations, can be used (see Current Protocols in Molecular Biology, Ausubel, F. 
et al., eds, John Wiley & Sons, including all supplements through 1999). For 
example, a biological sample (a "test sample") from a test subject (the "test 

30 individual") of genomic DNA, RNA, or cDNA, is obtained from an individual, such 
as an individual suspected of having, being susceptible to or predisposed for, or 
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carrying a defect for, the disease or condition, or the susceptibility to the disease or 
condition, associated with a KChlPl gene (e.g., Type II diabetes). The individual can 
be an adult, child, or fetus. The test sample can be from any source which contains 
genomic DNA, such as a blood sample, sample of amniotic fluid, sample of 
5 cerebrospinal fluid, or tissue sample from skin, muscle, buccal or conjunctival 

mucosa, placenta, gastrointestinal tract or other organs. A test sample of DNA from 
fetal cells or tissue can be obtained by appropriate methods, such as by amniocentesis 
or chorionic villus sampling. The DNA, RNA, or cDNA sample is then examined to 
determine whether a polymorphism in a KChlPl nucleic acid is present, and/or to 

10 determine which splicing variant(s) encoded by the KChlPl is present. The presence 
of the polymorphism or splicing variant(s) can be indicated by hybridization of the 
gene in the genomic DNA, RNA, or cDNA to a nucleic acid probe. A "nucleic acid 
probe", as used herein, can be a DNA probe or an RNA probe; the nucleic acid probe 
can contain, for example, at least one polymorphism in a KChlPl nucleic acid (e.g., 

15 as set forth in Table 10) and/or contain a nucleic acid encoding a particular splicing 
variant of a KChlPl nucleic acid. The probe can be any of the nucleic acid molecules 
described above (e.g., the gene or nucleic acid, a fragment, a vector comprising the 
gene or nucleic acid, a probe or primer, etc.). 

To diagnose Type II diabetes, or a susceptibility to Type II diabetes, or 

20 another condition associated with a KChlPl gene, a hybridization sample is formed 
by contacting the test sample containing a KChlPl nucleic acid with at least one 
nucleic acid probe. A preferred probe for detecting mRNA or genomic DNA is a 
labeled nucleic acid probe capable of hybridizing to mRNA or genomic DNA 
sequences described herein. The nucleic acid probe can be, for example, a full-length 

25 nucleic acid molecule, or a portion thereof, such as an oligonucleotide of at least 15, 
30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize 
under stringent conditions to appropriate mRNA or genomic DNA. For example, the 
nucleic acid probe can be all or a portion of one of SEQ ID NOs: 1 14-258 or the 
complement thereof, or a portion thereof. Other suitable probes for use in the 

30 diagnostic assays of the invention are described above (see e.g., probes and primers 
discussed under the heading, te Nucleic Acids of the Invention"). 
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The hybridization sample is maintained under conditions that are sufficient to 
allow specific hybridization of the nucleic acid probe to a KChlPl nucleic acid. 
"Specific hybridization", as used herein, indicates exact hybridization (e.g., with no 
mismatches). Specific hybridization can be performed under high stringency 
5 conditions or moderate stringency conditions, for example, as described above. In a 
particularly preferred embodiment, the hybridization conditions for specific 
hybridization are high stringency. 

Specific hybridization, if present, is then detected using standard methods. If 
specific hybridization occurs between the nucleic acid probe and KChlPl nucleic acid 

10 in the test sample, then the KChlPl has the polymorphism, or is the splicing variant, 
that is present in the nucleic acid probe. More than one nucleic acid probe can also be 
used concurrently in this method. Specific hybridization of any one of the nucleic 
acid probes is indicative of a polymorphism in the KChlPl nucleic acid, or of the 
presence of a particular splicing variant encoding the KChlPl nucleic acid and is 

15 therefore diagnostic for a susceptibility to a disease or condition associated with a 
KChlPl nucleic acid (e.g., Type II diabetes). 

In Northern analysis (see Current Protocols in Molecular Biology, Ausubel, R 
et aL, eds., John Wiley & Sons, supra) the hybridization methods described above are 
used to identify the presence of a polymorphism or a particular splicing variant, 

20 associated with a susceptibility to a disease or condition associated with a KChlPl 
gene (e.g., Type II diabetes). For Northern analysis, a test sample of RNA is obtained 
from the individual by appropriate means. Specific hybridization of a nucleic acid 
probe, as described above, to RNA from the individual is indicative of a 
polymorphism in a KChlPl nucleic acid, or of the presence of a particular splicing 

25 variant encoded by a KChlPl nucleic acid and is therefore diagnostic for Type II 
diabetes or a susceptibility to Type II diabetes or a condition associated with a 
KChlPl nucleic acid (e.g., Type II diabetes). 

For representative examples of use of nucleic acid probes, see, for example, 
U.S. Patents NO: 5,288,611 and 4,851,330. 

30 Alternatively, a.peptide nucleic acid (PNA) probe can be used instead of a 

nucleic acid probe in the hybridization methods described above. PNA is a DNA 
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mimic having a peptide-like, inorganic backbone, such as N-(2-aminoethyl)glycine 
units, with an organic base (A* G, C, T or U) attached to the glycine nitrogen via a 
methylene carbonyl linker (see, for example, Nielsen, P.E. et al. 9 Bioconjugaie 
Chemistry 5, American Chemical Society, p. 1 (1994). The PNA probe can be 
5 designed to specifically hybridize to a gene having a polymorphism associated with a 
susceptibility to a disease or condition associated with a KChlPl nucleic acid (e.g., 
Type n diabetes). Hybridization of the PNA probe to a KChlPl gene is diagnostic for 
Type II diabetes or a susceptibility to Type II diabetes or a condition associated with a 
KChlPl nucleic acid. 

10 In another method of the invention, alteration analysis by restriction digestion 

can be used to detect an altered gene, or genes containing a polymorphism^), if the 
alteration (mutation) or polymorphism in the gene results in the creation or 
elimination of a restriction site. A test sample containing genomic DNA is obtained 
from the individual. Polymerase chain reaction (PGR) can be used to amplify a 

15 KChlPl nucleic acid (and, if necessary, the flanking sequences) in the test sample of 
genomic DNA from the test individual. RFLP analysis is conducted as described (see 
Current Protocols in Molecular Biology?, supra). The digestion pattern of the relevant 
DNA fragment indicates the presence or absence of the alteration or polymorphism in 
the KChlPl nucleic acid, and therefore indicates the presence or absence of Type H 

20 diabetes or the susceptibility to a disease or condition associated with a KChIP 1 
nucleic acid. 

Sequence analysis can also be used to detect specific polymorphisms in a 
KChIP 1 nucleic acid. A test sample of DNA or RNA is obtained from the test 
individual. PCR or other appropriate methods can be used to amplify the gene or 

25 nucleic acid, and/or its flanking sequences, if desired. The sequence of a KChlPl 

nucleic acid, or a fragment of the nucleic acid, or cDNA, or fragment of the cDNA, or 
mRNA, or fragment of the mRNA, is determined, using standard methods. The 
sequence of the nucleic acid, nucleic acid fragment, cDNA, cDNA fragment, mRNA, 
or mRNA fragment is compared with the known nucleic acid sequence of the gene or 

30 cDNA {e.g., one or more of SEQ ID NOs:, 1 14-258 or a complement thereof) or 
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mRNA, as appropriate. The presence of a polymorphism in the KChlPl indicates that 
the individual has Type II diabetes or a susceptibility to Type II diabetes. 

Allele-specific oligonucleotides can also be used to detect the presence of a 
polymorphism in a KChlPl nucleic acid, through the use of dot-blot hybridization of 

5 amplified oligonucleotides with allele-specific oligonucleotide (ASO) probes (see, for 
example, Saiki, R. et al. 9 Nature 324:163-166 (1986)). An "allele-specific 
oligonucleotide" (also referred to herein as an "allele-specific oligonucleotide probe") 
is an oligonucleotide of approximately 1 0-50 base pairs, preferably approximately 1 5- 
30 base pairs, that specifically hybridizes to a KChlPl nucleic acid, and that contains 

10 a polymorphism associated with a susceptibility to a disease or condition associated 
with a KChlPl nucleic acid. An allele-specific oligonucleotide probe that is specific 
for particular polymorphisms in a KChlPl nucleic acid can be prepared, using 
standard methods (see Current Protocols in Molecular Biology* , supra). To identify 
polymorphisms in the gene that are associated with a disease or condition associated 

15 with a KChlPl nucleic acid or a susceptibility to a disease or condition associated 
with a KChlPl nucleic acid a test sample of DNA is obtained from the individual. 
PCR can be used to amplify all or a fragment of a KChlPl nucleic acid and its 
flanking sequences. The DNA containing the amplified KChlPl nucleic acid (or 
fragment of the gene or nucleic acid) is dot-blotted, using standard methods (see 

20 Cwrent Protocols in Molecular Biology, supra), and the blot is contacted with the 
oligonucleotide probe. The presence of specific hybridization of the probe to the 
amplified KChlPl nucleic acid is then detected. Hybridization of an allele-specific 
oligonucleotide probe to DNA from the individual is indicative of a polymorphism in 
the KChlPl nucleic acid, and is therefore indicative of a disease or condition 

25 associated with a KChlPl nucleic acid or susceptibility to a disease or condition 
associated with a KChIP 1 nucleic acid (e.g., Type II diabetes). 

The invention further provides allele-specific oligonucleotides that hybridize 
to the reference or variant allele of a gene or nucleic acid comprising a single 
nucleotide polymorphism or to the complement thereof. These oligonucleotides can 

30 be probes or primers. 
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An allele-specific primer hybridizes to a site on target DNA overlapping a 
polymorphism and only primes amplification of an allelic form to which the primer 
exhibits perfect complementarity. See Gibbs, Nucleic Acid Res. 17, 2427-2448 
(1989). This primer is used in conjunction with a second primer, which hybridizes at 
5 a distal site. Amplification proceeds from the two primers, resulting in a detectable 
product, which indicates the particular allelic form is present. A control is usually 
performed with a second pair of primers, one of which shows a single base mismatch 
at the polymorphic site and the other of which exhibits perfect complementarity to a 
1 distal site. The single-base mismatch prevents amplification and no detectable 

10 product is formed. The method works best when the mismatch is included in the 3- 
most position of the oligonucleotide aligned with the polymorphism because this 
position is most destabilizing to elongation from the primer (see, e.g., WO 93/22456). 

With the addition of such analogs as locked nucleic acids (LNAs), the size of 
primers and probes can be reduced to as few as 8 bases. LNAs are a novel class of 

15 bicyclic DNA analogs in which the 2' and 4' positions in the furanose ring are joined 
via an O-methylene (oxy-LNA), S-methylene (thio-LNA), or amino methylene 
(amino-LNA) moiety. Common to all of these LNA variants is an affinity toward 
complementary nucleic acids, which is by far the highest reported for a DNA analog. 
For example, particular all oxy-LNA nonamers have been shown to have melting 

20 temperatures of 64 °C and 74 °C when in complex with complementary DNA or RNA, 
respectively, as oposed to 28 °C for both DNA and RNA for the corresponding DNA 
nonamer. Substantial increases in T m are also obtained when LNA monomers are 
used in combination with standard DNA or RNA monomers. For primers and probes, 
depending on where the LNA monomers are included (e.g., the 3 ! end, the 5 ! end, or in 

25 the middle), the T m could be increased considerably. 

. . In another embodiment, arrays of oligonucleotide probes that are 
complementary to target nucleic acid sequence segments from an individual, can be 
used to identify polymorphisms in a KChlPl nucleic acid. For example, in one 
embodiment, an oligonucleotide array can be used. Oligonucleotide arrays typically 

30 comprise a plurality of different oligonucleotide probes that are coupled to a surface 
of a substrate in different known locations. These oligonucleotide arrays, also 
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described as "Genechips™," have been generally described in the art, for example, 
U.S. Pat NO: 5,143,854 and PCT patent publication Nos. WO 90/15070 and 
92/10092. These arrays can generally be produced using mechanical synthesis 
methods or light directed synthesis methods that incorporate a combination of 
5 photolithographic methods and solid phase oligonucleotide synthesis methods. See 
Fodor et al. 9 Science 251:767-777 (1991), Pirrung et al. 9 U.S. Pat. NO: 5,143,854 (see 
also PCT Application NO: WO 90/15070) arid Fodor et aL, PCT Publication NO: WO 
92/10092 and U.S. Pat. NO: 5,424,186, the entire teachings of each of which are 
incorporated by reference herein. Techniques for the synthesis of these arrays using 

10 mechanical synthesis methods are described in, e.g., U.S. Pat. NO: 5,384,261; the 
entire teachings of which are incorporated by reference herein. In another example, 
linear arrays can be utilized. 

Once an oligonucleotide array is prepared, a nucleic acid of interest is 
hybridized with the array and scanned for polymorphisms. Hybridization and 

15 scanning are generally carried out by methods described herein and also in, e.g., 

published PCT Application Nos. WO 92/10092 and WO 95/1 1995, and U.S. Pat. NO: 
5,424,186, the entire teachings of which are incorporated by reference herein. In 
brief, a target nucleic acid sequence that includes one or more previously identified 
polymorphic markers is amplified by well-known amplification techniques, e.g. , PCR. 

20 Typically, this involves the use of primer sequences that are complementary to the 
two strands of the target sequence both upstream and downstream from the 
polymorphism. Asymmetric PCR techniques may also be used. Amplified target, 
generally incorporating a label, is then hybridized with the array under appropriate 
conditions. Upon completion of hybridization and washing of the array, the array is 

25 scanned to determine the position on the array to which the target sequence 

hybridizes. The hybridization data obtained from the scan is typically in the form of 
fluorescence intensities as a function of location on the array. 

Although primarily described in terms of a single detection block, e.g., for 
detection of a single polymorphism, arrays can include multiple detection blocks, and 

30 thus be capable of analyzing multiple, specific polymorphisms. In alternative 

arrangements, it will generally be understood that detection blocks may be grouped 
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within a single array or in multiple, separate arrays so that varying, optimal conditions 
may be used during the hybridization of the target to the array. For example, it may 
often be desirable to provide for the detection of those polymorphisms that fall within 
G-C rich stretches of a genomic sequence, separately from those falling in A-T rich 
5 segments. This allows for the separate optimization of hybridization conditions for 
each situation. 

Additional uses of oligonucleotide arrays for polymorphism detection can be 
found, for example, in U.S. Patents Nos. 5,858,659 and 5,837,832, the entire 
teachings of which are incorporated by reference herein. Other methods of nucleic 

10 acid analysis can be used to detect polymorphisms in a Type II diabetes gene or 

variants encoding by a Type II diabetes gene. Representative methods include direct 
manual sequencing (Church and Gilbert, Proc. Natl Acad. Sci. USA 81 :1991-1995 
(1988); Sanger, F. et al 9 Proc. Natl Acad. Set USA 74:5463-5467 (1977); Beavis et 
al t U.S. Pat. NO: 5,288,644); automated fluorescent sequencing; single-stranded 

15 conformation polymorphism assays (SSCP); clamped denaturing gel electrophoresis 
(CDGE); denaturing gradient gel electrophoresis (DGGE) (Sheffield, V.C. et ah 9 
Proc. Natl Acad. Sci. USA 86:232-236 (1989)), mobility shift analysis (Orita, M. et 
al,Proc. Natl Acad. Sci. USA 86:2766-2770 (1989)), restriction enzyme analysis 
(Flavell et al 9 Cell 15:25 (1978); Geever, et al, Proc. Natl Acad. Sci, USA 78:5081 

20 (1981)); heteroduplex analysis; chemical mismatch cleavage (CMC) (Cotton et al, 
Proc. Natl Acad. Sci. USA 85:4397-4401 (1985)); RNase protection assays (Myers, 
R.M. et al, Science 230: 1242 (1985)); use of polypeptides which recognize 
nucleotide mismatches, such as E. coli mutS protein; allele-specific PCR, for 
example. 

25 hi one embodiment of the invention, diagnosis of a disease or condition 

associated with a KChlPl nucleic acid (e.g., Type II diabetes) or a susceptibility to a 
disease or condition associated with a KChlPl nucleic acid (e.g., Type II diabetes) 
can also be made by expression analysis by quantitative PCR (kinetic thermal 
cycling). This technique, utilizing TaqMan®, can be used to allow the identification of 

30 polymorphisms and whether a patient is homozygous or heterozygous. The technique 
can assess the presence of an alteration in the expression or composition of the 
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polypeptide encoded by a KChlPl nucleic acid or splicing variants encoded by a 
KChlPl nucleic acid. Further, the expression of the variants can be quantified as 
physically or functionally different 

In another embodiment of the invention, diagnosis of Type II diabetes or a 
5 susceptibility to Type II diabetes 9or a condition associated with a KChlPl gene) can 
be made by examining expression and/or composition of a KChlPl polypeptide, by a 
variety of methods, including enzyme linked immunosorbent assays (ELISAs), 
Western blots, immunoprecipitations and immunofluorescence. A test sample from 
an individual is assessed for the presence of an alteration in the expression and/or an 

10 alteration in composition of the polypeptide encoded by a KChlPl nucleic acid, or for 
the presence of a particular variant encoded by a KChIP 1 nucleic acid. An alteration 
in expression of a polypeptide encoded by a KChlPl nucleic acid can be, for example, 
an alteration in the quantitative polypeptide expression (i.e., the amount of 
polypeptide produced); an alteration in the composition of a polypeptide encoded by a 

15 KChlPl nucleic acid is an alteration in the qualitative polypeptide expression (e.g., 
expression of an altered KChlPl polypeptide or of a different splicing variant). In a 
preferred embodiment, diagnosis of the disease or condition associated with KChlPl 
nucleic acid or a susceptibility to a disease or condition associated with a KChlPl 
nucleic acid is made by detecting a particular splicing variant encoded by that 

20 KChlPl nucleic acid, or a particular pattern of splicing variants. 

Both such alterations (quantitative and qualitative) can also be present. The 
term "alteration" in the polypeptide expression or composition, as used herein, refers 
to an alteration in expression or composition in a test sample, as compared with the 
expression or composition of polypeptide by a KChlPl nucleic acid in a control 

25 sample. A control sample is a sample that corresponds to the test sample (e.g., is from 
the same type of cells), and is from an individual who is not affected by a 
susceptibility to a disease or condition associated with a KChlPl nucleic acid. An 
alteration in the expression or composition of the polypeptide in the test sample, as 
compared with the control sample, is indicative of a susceptibility to a disease or 

30 condition associated with a KChlPl nucleic acid. Similarly, the presence of one or 
more different splicing variants in the test sample, or the presence of significantly 
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different amounts of different splicing variants in the test sample, as compared with 
the control sample, is indicative of a disease or condition associated with a KChlPl 
nucleic acid or a susceptibility to a disease or condition associated with a KChlPl 
nucleic acid. Various means of examining expression or composition of the 
5 polypeptide encoded by a KChlPl nucleic acid can be used, including: spectroscopy, 
colorimetry, lectrophoresis, isoelectric focusing, and immunoassays {e.g., David et 
al, U.S. Pat 4,376,110) such as immunoblotting (see also Current Protocols in 
Molecular Biology, particularly Chapter 10). For example, in one embodiment, an 
antibody capable of binding to the polypeptide {e.g. , as described above), preferably 
10 an antibody with a detectable label, can be used. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 
F(ab') 2 ) can be used. The term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or antibody by coupling {i.e., 
physically linking) a detectable substance to the probe or antibody, as well as indirect 
15 labeling of the probe or antibody by reactivity with another reagent that is directly 

labeled. Examples of indirect labeling include detection of a primary antibody using a 
fluorescently labeled secondary antibody and end-labeling of a DNA probe with 
biotin such that it can be detected with fluorescently labeled streptavidin. 

Western blotting analysis, using an antibody as described above that 
♦ 20 specifically binds to a polypeptide encoded by an altered KChJP 1 nucleic acid {e.g. , a 
KChlPl nucleic acid having one or more alterations as shown in Table 10), or an 
antibody that specifically binds to a polypeptide encoded by a non-altered nucleic 
acid, or an antibody that specifically binds to a particular splicing variant encoded by 
a nucleic acid, can be used to identify the presence in a test sample of a particular 
25 splicing variant or of a polypeptide encoded by a polymorphic or altered KChlPl 
nucleic acid, or the absence in a test sample of a particular splicing variant or of a 
polypeptide encoded by a non-polymorphic or non-altered nucleic acid. The presence 
of a polypeptide encoded by a polymorphic or altered nucleic acid, or the absence of a 
polypeptide encoded by a non-polymorphic or non-altered nucleic acid, is diagnostic 
30 for a disease or condition associated with a KChlPl nucleic acid or a susceptibility to 
a disease or condition associated with a KChlPl nucleic acid {e.g., Type II diabetes), 
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as is the presence (or absence) of particular splicing variants encoded by the KChlPl 
nucleic acid. 

In one embodiment of this method, the level or amount of polypeptide 
encoded by a KChlPl nucleic acid in a test sample is compared with the level or 
5 amount of the polypeptide encoded by the KChIP 1 in a control sample. A level or 
amount of the polypeptide in the test sample that is higher or lower than the level or 
amount of the polypeptide in the control sample, such that the difference is 
statistically significant, is indicative of an alteration in the expression of the 
polypeptide encoded by the KChlPl nucleic acid, and is diagnostic for a disease or 
10 condition associated with a KChlPl nucleic acid or a susceptibility to a disease or 
condition associated with that KChlPl nucleic acid (e.g., Type II diabetes). 
Alternatively, the composition of the polypeptide encoded by a KChlPl nucleic acid 
in a test sample is compared with the composition of the polypeptide encoded by the 
KChIP 1 nucleic acid in a control sample (e.g. , the presence of different splicing 

15 variants). A difference in the composition of the polypeptide in the test sample, as 
compared with the composition of the polypeptide in the control sample, is diagnostic 
for a disease or condition associated with a KChlPl nucleic acid or a susceptibility to 
a disease or condition associated with that KChIP 1 nucleic acid (e.g. , Type II 
diabetes). In another embodiment, both the level or amount and the composition of 

20 the polypeptide can be assessed in the test sample and in the control sample. A 

difference in the amount or level of the polypeptide in the test sample, compared to 
the control sample; a difference in composition in the test sample, compared to the 
control sample; or both a difference in the amount or level, arid a difference in. the 
composition, is indicative of a disease of condition associated with a KChlPl nucleic 

25 acid or a susceptibility to a disease or condition associated with that KChlPl nucleic 
acid. 

The invention further pertains to a method for the diagnosis or identification of 
a susceptibility to Type E diabetes in an individual, by identifying an at-risk 
haplotype (e.g., a haplofype comprising a KChlPl nucleic acid). The KChlPl- 
30 associated haplotypes, e.g., those described in Table 2 and Table 5, describe a set of 
genetic markers ("alleles"). In a certain embodiment, the haplotype can comprise one 
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or more alleles, two or more alleles, three or more alleles, four or more alleles, or five 
or more alleles. The genetic markers are particular "alleles" at "polymorphic sites" 
associated with KChlPl . A nucleotide position at which more than one sequence is 
possible in a population (either a natural population or a synthetic population, e.g., a 
5 library of synthetic molecules), is referred to herein as a "polymorphic site". Where a 
polymorphic site is a single nucleotide in length, the site is referred to as a single 
nucleotide polymorphism ("SNP"). For example, if at a particular chromosomal 
location, one member of a population has an adenine and another member of the 
population has a thymine at the same position, then this position is a polymorphic site, 

10 and, more specifically, the polymorphic site is a SNP. Polymorphic sites can allow 
for differences in sequences based on substitutions, insertions or deletions. Each 
version of the sequence with respect to the polymorphic site is referred to herein as an 
"allele" of the polymorphic site. Thus, in the previous example, the SNP allows for 
both an adenine allele and a thymine allele. 

15 Typically, a reference sequence is referred to for a particular sequence. 

Alleles that differ from the reference are referred to as "variant" alleles. For example, 
the reference KChlPl sequence is described herein by SEQ ID NO: 1 . The term, 
"variant KChlPl", as used herein/refers to a sequence that differs from SEQ ID NO: 
1, but is otherwise substantially similar. The genetic markers that make up the 

20 haplotypes described herein are KChlPl variants. The variants of KChlPl that are 
used to determine the haplotypes disclosed herein of the present invention are 
associated with Type II diabetes or a susceptibility to Type II diabetes. 

Additional variants can include changes that affect a polypeptide, e.g., the 
KChlPl polypeptide. These sequence differences, when compared to a reference 

25 nucleotide sequence, can include the insertion or deletion of a single nucleotide, or of 
more than one nucleotide, resulting in a frame shift; the change of at least one 
nucleotide, resulting in a change in the encoded amino acid; the change of at least one 
nucleotide, resulting in the generation of a premature stop codon; the deletion of 
several nucleotides, resulting in a deletion of one or more amino acids encoded by the 

30 nucleotides; the insertion of one or several nucleotides, such as by unequal 

recombination or gene conversion, resulting in an interruption of the coding sequence 
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of a reading frame; duplication of all or apart of a sequence; transposition; or a 
rearrangement of a nucleotide sequence, as described in detail above. Such sequence 
changes alter the polypeptide encoded by a KChJPl nucleic acid. For example, if the 
change in the nucleic acid sequence causes a frame shift, the frame shift can result in 
5 a change in the encoded amino acids, and/or can result in the generation of a 

premature stop codon, causing generation of a truncated polypeptide. Alternatively, a 
polymorphism associated with Type H diabetes or a susceptibility to Type H diabetes 
can be a synonymous change in one or more nucleotides {i.e., a change that does not 
result in a change in the amino acid sequence). Such a polymorphism can, for 

10 example, alter splice sites, affect the stability or transport of mRNA, or otherwise 
affect the transcription or translation of the polypeptide. The polypeptide encoded by 
the reference nucleotide sequence is the "reference" polypeptide with a particular 
reference amino acid sequence, and polypeptides encoded by variant alleles are 
referred to as "variant" polypeptides with variant amino acid sequences. 

15 Haplotypes are a combination of genetic markers, e.g,, particular alleles at 

polymorphic sites. The haplotypes described herein, e.g., having markers such as 
those shown in Table 10, Table 1 1, Table 12 or Table 13, are found more frequently 
in individuals with Type II diabetes than in individuals without Type H diabetes. 
Therefore, these haplotypes have predictive value for detecting Type H diabetes or a 
20 susceptibility to Type E diabetes in an individual. The haplotypes described herein 
are a combination of various genetic markers, e.g., SNPs and microsatellites. 
Therefore, detecting haplotypes can be accomplished by methods known in the art for 
detecting sequences at polymorphic sites, such as the methods described above. 

25 HAPLOTYPE SCREENING 

In the methods for the diagnosis and identification of susceptibility to Type II 
diabetes or Type H diabetes in an individual, an at-risk haplotype is identified. In one 
embodiment, the at-risk haplotype is one which confers a significant risk of Type H 
diabetes. In one embodiment, significance associated with a haplotype is measured 

30 by an odds ratio. In a further embodiment, the significance is measured by a 

percentage. In one embodiment, a significant risk is measured as an odds ratio of at 
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least about 1.2, including by not limited to: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, and 1.9. In 
a further embodiment, an odds ratio of at least 1.2 is significant. In a further 
embodiment, an odds ratio of at least about 1.5 is significant. In a further 
embodiment, a significant increase in risk is at least about 1.7 is significant. In a 
5 further embodiment, a significant increase in risk is at least about 20%, including but 
not limited to about 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 
80%, 85%, 90%, 95% and 98%. In a further embodiment, a significant increase in 
risk is at least about 50%. It is understood however, that identifying whether a risk is 
medically significant may also depend on a variety of factors, including the specific 

10 disease, the haplotype, and often, environmental factors. 

The invention also pertains to methods of diagnosing Type II diabetes or a 
susceptibility to Type II diabetes in an individual, comprising screening for an at-risk 
haplotype in, or comprising portions of, the KChlPlgene, where the haplotype is 
more frequently present in an individual susceptible to Type II diabetes (affected), 

15 compared to the frequency of its presence in a healthy individual (control), and 
wherein the presence of the haplotype is indicative of Type II diabetes or 
susceptibility to Type II diabetes. Standard techniques for genotyping for the 
presence of SNPs and/or microsatellite markers can be used, such as fluorescent based 
techniques (Chen, et al. 9 Genome Res, 9, 492 (1999)), PGR, LCR, Nested PCR and 

20 other techniques for nucleic acid amplification. In a preferred embodiment, the 
method comprises assessing in an individual the presence or frequency of SNPs 
and/or microsatellites in, comprising portions of, the KChlPlgene, wherein an excess 
or higher frequency of the SNPs and/or microsatellites compared to a healthy control 
individual is indicative that the individual has Type II diabetes or is susceptible to 

25 Type II diabetes. See, for example, Tables 6, 7, 9, 1 1 and 13 (below) for SNPs and 
markers that can form haplotypes that can be used as screening tools. These markers 
and SNPs can be used to design diagnostic tests for determining Type II diabetes or a 
susceptibility to Type II diabetes. For example, an at-risk haplotype can include 
microsatellite markers and/or SNPs such as those set forth in Table 10, Table 11, 

30 Table 12 and/ or Table 13. The presence of the haplotype is diagnostic of Type II 
diabetes or of a susceptibility to Type II diabetes. Haplotype analysis involves 
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defining a candidate susceptibility locus using LOD scores. The defined regions are 
then ultra-fine mapped with microsatellite markers with an average spacing between 
markers of less than lOOkb. All usable microsatellite markers that found in public 
databases and mapped within that region can be used. In addition, microsatellite 
5 markers identified within the deCODE genetics sequence assembly of the human 
genome can be used. 

The frequencies of haplotypes in the patient and the control groups using an 
expectation-maximization algorithm can be estimated (Dempster A. et aL, 1977. J. R 
Stat Soc. B, 39:1-389). An implementation of this algorithm that can handle missing 

1 0 genotypes and uncertainty with the phase can be used. Under the null hypothesis, the 
patients and the controls are assumed to have identical frequencies. Using a 
likelihood approach, an alternative hypothesis where a candidate at-risk-haplotype, 
which can include the markers described herein, is allowed to have a higher frequency 
in patients than controls, while the ratios of the frequencies of other haplotypes are 

15 assumed to be the same in both groups is tested. Likelihoods are maximized 

separately under both hypotheses and a corresponding 1-df likelihood ratio statistics is 
used to evaluate the statistic significance. 

To look for at-risk-haplotypes in the 1-lod drop, for example, association of all 
possible combinations of genotyped markers is studied, provided those markers span a 

20 practical region. The combined patient and control groups can be randomly divided 
into two sets, equal in size to the original group of patients and controls. The 
hapldtype analysis is then repeated and the most significant p-value registered is 
determined. This randomization scheme can be repeated, for example, over 100 times 
to construct an empirical distribution of p-values. 

25 The at-risk haplotypes identified in Table 2 (haplotypes identified as Al, A2, 

A3, A4, A5, A6, Bl, B2, B3, B4 and B5) or Table 5 (haplotypes identified as D1,D2, 
D3, D4 and D5) are associated with Type II diabetes or a susceptibility to Type II 
diabetes. In certain embodiments, a haplotype associated with Type II diabetes or a 
susceptibility to Type II diabetes comprises markers DG5S879, DG5S881, D5S2075, 

30 DG5S883 and DG5S38 at the 5q35 locus; or DG5S1058 and DG5S37 at the 5q35 
locus; or DG5S1058, DG5S37 and DG5S101 at the 5q35 locus; or DG5S881, 
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DG5S1058, D5S2075, DG5S883 and DG5S3S at the 5q35 locus; or DG5S879, 
DG5S1058and DG5S37; or DG5S881, D5S2075, DG5S883 and DG5S38atthe 
5q35 locus; DG5S953, DG5S955, DG5S13 and DG5S959 at the 5q35 locus; or 
DG5S888 and DG5S953 at the 5q35 locus; or DG5S953, DG5S955 and DG5S124 at 
5 the 5q35 locus; or DG5S888, DG5S44 and DG5S953 at the 5q35 locus; or DG5S953, 
DQ5S955, DG5S13, DG5S123, and DG5S959 at the 5q35 locus. The presence of the 
haplotype is diagnostic of Type II diabetes or of a susceptibility to Type n diabetes. 
Also described herein is a haplotype associated with Type II diabetes or a 
susceptibility to Type II diabetes comprising markers DG5S13, KCP_1 152, and 
D5S625 at the 5q35 locus; the presence of the haplotype is diagnostic of Type II 
diabetes or of a susceptibility to Type II diabetes, in one particular embodiment, the 
presence of the -4, 1, 0 haplotype at DG5S13, KCP J. 152, and D5S625 is diagnostic 
of Type II diabetes or of a susceptibility to Type n diabetes. In another embodiment, 
a haplotype associated with Type II diabetes or a susceptibility to Type II diabetes in 
an individual, comprises markers DG5S124, KCP__1 152, KCPJ2649, KPC_4976 and 
KPC-16152 at the 5q35 locus. In one particular embodiment, the presence of the 0, 1, 
1, 3 and 0 haplotype at DG5S124, KCP_1152, KCP_2649, KPC_4976 and KPC- 
16152 is diagnostic of Type II diabetes or of a susceptibility to Type II diabetes. In 
another embodiment, a haplotype associated with Type II diabetes or a susceptibility 
to Type II diabetes in an individual, comprises markers KCP__1 73982, KCP_1 5400, 
and KCP_18069. In one particular embodiment, the presence of the 0, 1, 1 haplotype 
at KCPJL 73982, KCP_15400, and KCP.J8069 is diagnostic of Type H diabetes or of 
a susceptibility to Type II diabetes. 

In additional embodiments, a haplotype associated with Type II diabetes or a 
susceptibility to Type II diabetes comprises markers DG5S 124, KCP_1 1 52, 
KCP_2649, KCP_4976, and KCP_16152 at the 5q35 locus, as well as one of the 
following 3 markers: KCP_197678, KCP_197775, and KCP_202795 at the 5q35 
locus; the presence of the haplotype is diagnostic of Type II diabetes or of a 
susceptibility to Type II diabetes. In particular embodiments, the presence of the 0, 3, 
1, 1, 3, 0 haplotype at DG5S 124, KCPJ97679, KCP_U52, KCP_2649, KCP_4976, 
and KCP_16152; the presence of the 0, 3, 1, 1, 3, 0 haplotype at DG5S124, 
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KCP_1 97775, KCP_1 152, KCP_2649, KCP 4976, and KOM6152; or the presence 
of the 0, 1,1, l,3,0haplotypeatDG5S124,KCP_202795,KCP_1152,KCP__2649, 
KCP_4976, and KCP_1 6152; is diagnostic of Type II diabetes or of a susceptibility to 
Type II diabetes. 

5 Kits (e.g., reagent kits) useful in the methods of diagnosis comprise 

components useful in any of the methods described herein, including for example, 
hybridization probes or primers as described herein (e.g., labeled probes or primers), 
reagents for detection of labeled molecules, restriction enzymes (e.g., for RFLP 
analysis), allele-specific oligonucleotides, antibodies which bind to altered or to non- 
10 altered (native) KChlPl polypeptide, means for amplification of nucleic acids 

comprising a KChlPl nucleic acid, or means for analyzing the nucleic acid sequence 
of a KChlPl nucleic acid or for analyzing the amino acid sequence of a KChlPl 
polypeptide as described herein, etc. In one embodiment, the kit for diagnosing a 
Type II diabetes or a susceptibility to Type II diabetes can comprise primers for 
15 nucleic acid amplification of a region in the KChlPl nucleic acid comprising an at- 
risk haplotype that is more frequently present in an individual having Type II diabetes 
or who is susceptible to Type II diabetes. The primers can be designed using portions 
of the nucleic acids flanking SNPs that are indicative of Type II diabetes. In a certain 
embodiment, the primers are designed to amplify regions of the KChlPl gene 
20 associated with an at-risk haplotype for Type II diabetes, as shown in Table 1 0 and 
13, or more particularly the haplotypes described in Tables 2 and 5. 

SCREENING ASSAYS AND AGENTS IDENTIFIED THEREBY 

The invention provides methods (also referred to herein as "screening assays") 

25 for identifying the presence of a nucleotide that hybridizes to a nucleic acid of the 
invention, as well as for identifying the presence of a polypeptide encoded by a 
nucleic acid of the invention. In one embodiment, the presence (or absence) of a 
nucleic acid molecule of interest (e.g., a nucleic acid that has significant homology 
with a nucleic acid of the invention) in a sample can be assessed by contacting the 

30 sample with a nucleic acid comprising a nucleic acid of the invention (e.g., a nucleic 
acid having the sequence of one of SEQ ID NOs: 1 , 1 14-258, or the complement 
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thereof, or a nucleic acid encoding an amino acid having the sequence of one of SEQ 
ID NOs: 2, or a fragment or variant of such nucleic acids), under stringent conditions 
as described above, and then assessing the sample for the presence (or absence) of 
hybridization. In one embodiment, high stringency conditions are conditions 
5 appropriate for selective hybridization. In another embodiment, a sample containing 
the nucleic acid molecule of interest is contacted with a nucleic acid containing a 
contiguous nucleotide sequence (e.g., a primer or a probe as described above) that is 
at least partially complementary to a part of the nucleic acid molecule of interest (e.g., 
a KChlPl nucleic acid), and the contacted sample is assessed for the presence or 

10 absence of hybridization. In another embodiment, the nucleic acid containing; a 

contiguous nucleotide sequence is completely complementary to apart of the nucleic 
acid molecule of interest. 

In any of these embodiments, all or a portion of the nucleic acid of interest can 
be subjected to amplification prior to performing the hybridization. 

15 In another embodiment, the presence (or absence) of a polypeptide of interest, 

such as a polypeptide of the invention or a fragment or variant thereof, in a sample 
can be assessed by contacting the sample with an antibody that specifically hybridizes 
to the polypeptide of interest {e.g., an antibody such as those described above), and 
then assessing the sample for the presence (or absence) of binding of the antibody to 

20 the polypeptide of interest. 

In another embodiment, the invention provides methods for identifying agents 
(e.g., fusion proteins, polypeptides, peptidomimetics, prodrugs, receptors, binding 
agents, antibodies, small molecules or other drugs, or ribozymes) which alter {e.g., 
increase or decrease) the activity of the polypeptides described herein, or which 

25 otherwise interact with the polypeptides herein. For example, such agents can be 
agents which bind to polypeptides described herein {e.g., KChlPl binding agents); 
which have a stimulatory or inhibitory effect on, for example, activity of polypeptides 
of the invention; or which change (e.g., enhance or inhibit) the ability of the 
polypeptides of the invention to interact with KChlPl binding agents (e.g., receptors 

30 or other binding agents); or which alter posttranslational processing of the KChlPl 
polypeptide (e.g., agents that alter proteolytic processing to direct the polypeptide 
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from where it is normally synthesized to another location in the cell, such as the cell 
surface; agents that alter proteolytic processing such that more polypeptide is released 
from the cell, etc. 

In one embodiment, the invention provides assays for screening candidate or 
5 test agents that bind to or modulate the activity of polypeptides described herein (or 
biologically active portion(s) thereof), as well as agents identifiable by the assays. 
Test agents can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 

10 deconvolution; the 'one-bead one-compound' library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to polypeptide libraries, while the other four approaches are applicable to 
polypeptide, non-peptide oligomer or small molecule libraries of compounds (Lam, 
K.S., Anticancer Di-ug Des. 12:145(1997)). 

15 In one embodiment, to identify agents which alter the activity of a KChlPl 

polypeptide, a cell, cell lysate, or solution containing or expressing a KChlPl 
polypeptide, or another splicing variant encoded by a KChlPl gene (such as 
~ comprising a SNP as shown in Table 10 and/or 3), or a fragment or derivative thereof 

(as'described above), can be contacted with an agent to be tested; alternatively, the 

20 polypeptide can be contacted directly with the agent to be tested. The level (amount) 
of KChlPl activity is assessed (e.g., the level (amount) of KChlPl activity is 
measured, either directly or indirectly), and is compared with the level of activity in a 
control (z.e., the level of activity of the KChlPl polypeptide or active fragment or 
derivative thereof in the absence of the agent to be tested). If the level of the activity 

25 in the presence of the agent differs, by an amount that is statistically significant, from 
the level of the activity in the absence of the agent, then the agent is an agent that 
alters the activity of a KChlPl polypeptide. An increase in the level of KChlPl 
activity relative to a control, indicates that the agent is an agent that enhances (is an 
agonist of) KChlPl activity. Similarly, a decrease in the level of KChlPl activity 

30 relative to a control, indicates that the agent is an agent that inhibits (is an antagonist 
of) KChlPl activity. In another embodiment, the level of activity of a KChlPl 
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polypeptide or derivative or fragment thereof in the presence of the agent to be tested, 
is compared with a control level that has previously been established. A level of the 
activity in the presence of the agent that differs from the control level by an amount 
that is statistically significant indicates that the agent alters KChlPl activity. 
5 The present invention also relates to an assay for identifying agents which 

alter the expression of a KChlPl nucleic acid (e.g., antisense nucleic acids, fusion 
proteins, polypeptides, peptidomimetics, prodrugs, receptors, binding agents, 
antibodies, small molecules or other drugs, or ribozymes) which alter (e.g., increase 
or decrease) expression (e.g., transcription or translation) of the gene or which 
10 otherwise interact with the nucleic acids described herein, as well as agents 

identifiable by the assays. For example, a solution containing a nucleic acid encoding 
a KChlPl polypeptide (e.g., a KChlPl gene or nucleic acid) can be contacted with an 
agent to be tested. The solution can comprise, for example, cells containing the 
nucleic acid or cell lysate containing the nucleic acid; alternatively, the solution can 
15 be another solution that comprises elements necessary for transcription/translation of 
the nucleic acid. Cells not suspended in solution can also be employed, if desired. 
The level and/or pattern of KChlPl expression (e.g., the level and/or pattern of 
mRNA or of protein expressed, such as the level and/or pattern of different splicing 
variants) is assessed, and is compared with the level and/pr pattern of expression in a 
20 control (i.e., the level and/or pattern of the KChlPl expression in the absence of the 
agent to be tested). If the level and/or pattern in the presence of the agent differs, by 
an amount or in a manner that is statistically significant, from the level and/or pattern 
in the absence of the agent, then the agent is an agent that alters the expression of a 
Type II diabetes gene. Enhancement of KChlPl expression indicates that the agent is 
25 an agonist of KChlPl activity. Similarly, inhibition of KChlPl expression indicates 
that the agent is an antagonist of KChlPl activity. In another embodiment, the level 
and/or pattern of KChlPl polypeptide(s) (e.g., different splicing variants) in the 
presence of the agent to be tested, is compared with a control level and/or pattern that 
have previously been established. A level and/or pattern in the presence of the agent 
30 that differs from the control level and/or pattern by an amount or in a manner that is 
statistically significant indicates that the agent alters KChIP 1 expression. 
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In another embodiment of the invention, agents which alter the expression of a 
KChlPl nucleic acid or which otherwise interact with the nucleic acids described 
herein, can be identified using a cell, cell lysate, or solution containing a nucleic acid 
encoding the promoter region of the KChlPl gene or nucleic acid operably linked to a 
5 reporter gene. After contact with an agent to be tested, the level of expression of the 
reporter gene (e.g., the level of mRNA or of protein expressed) is assessed, and is 
compared with the level of expression in a control (i.e., the level of the expression of 
the reporter gene in the absence of the agent to be tested). If the level in the presence 
of the agent differs, by an amount or in a manner that is statistically significant, from 
10 the level in the absence of the agent, then the agent is an agent that alters the . 

expression of the KChlPl, as indicated by its ability to alter expression of a gene that 
is operably linked to the KChIP 1 gene promoter. Enhancement of the expression of 
the reporter indicates that the agent is an agonist of KChlPl activity. Similarly, 
inhibition of the expression of the reporter indicates that the agent is an antagonist of 
1 5 KChJP 1 activity. In another embodiment, the level of expression of the reporter in the 
presence of the agent to be tested is compared with a control level that has previously 
been established. A level in the presence of the agent that differs from the control 
level by an amount or in a manner that is statistically significant indicates that the 
agent alters expression. 
JO Agents which alter the amounts of different splicing variants encoded by a 

KChlPl nucleic acid (e.g., an agent which enhances activity of a first splicing variant, 
and which inhibits activity of a second splicing variant), as Well as agents which are 
agonists of activity of a first splicing variant and antagonists of activity of a second 
splicing variant, can easily be identified using these methods described above. 
:5 In other embodiments of the invention, assays can be used to assess the impact 

of a test agent on the activity of a polypeptide in relation to a KCMP 1 binding agent. 
For example, a cell that expresses a compound that interacts with a KChlPl 
polypeptide (herein referred to as a "KChlPl binding agent", which can be a 
polypeptide or other molecule that interacts with a KChlPl polypeptide, such as a 
0 receptor) is contacted with a KChIP 1 in the presence of a test agent, and the ability of 
the test agent to alter the interaction between the KChIP 1 and the KChIP 1 binding 
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agent is determined. Alternatively, a cell lysate or a solution containing the KChlPl 
binding agent, can be used. An agent which binds to the KChlPl or the KChlPl 
binding agent can alter the interaction by interfering with, or enhancing the ability of 
the KChlPl to bind to, associate with, or otherwise interact with the KChlPl binding 
5 agent. Determining the ability of the test agent to bind to a KChlPl nucleic acid or a 
KChlPl binding agent can be accomplished, for example, by coupling the test agent 
with a radioisotope or enzymatic label such that binding of the test agent to the 
polypeptide can be determined by detecting the labeled with 125 1, 35 S, l4 C or 3 H, either 
directly or indirectly, and the radioisotope detected by direct counting of 

10 radioemmission or by scintillation counting. Alternatively, test agents can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. It is also within the scope of this 
invention to determine the ability of a test agent to interact with the polypeptide.. 

15 without the labeling of any of the interactants, For example, a microphysiometer can 
be used to detect the interaction of a test agent with a KChIP 1 polypeptide or a 
KChlPl binding agent without the labeling of either the test agent, KChlPl 
polypeptide, or the KChlPl binding agent. McConnell, H.M. et ai 9 Sciejtce 
257:1906-1912 (1992). As used herein, a "microphysiometer" {e.g., Cytosensor™) is 

20 an analytical instrument that measures the rate at which a cell acidifies its 

environment using a light-addressable potentiometric sensor (LAPS). Changes in this 
acidification rate can be used as an indicator of the interaction between ligand and 
polypeptide. 

Thus, these receptors can be used to screen for compounds that are agonists or 
25 antagonists, for use in treating a susceptibility to a disease or condition associated 
with a KChlPl gene or nucleic acid, or for studying a susceptibility to a disease or 
condition associated with a KChlPl {e.g., Type It diabetes). Drugs could be designed 
to regulate KChlPl activation that in turn can be used to regulate signaling pathways 
and transcription events of genes downstream. 
30 Ih another embodiment of the invention, assays can be used to identify 

polypeptides that interact with one or more KChlPl polypeptides, as described herein. 
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For example, a yeast two-hybrid system such as that described by Fields and Song 
(Fields, S. and Song, O., Nature 340:245-246 (1989)) can be used to identify 
polypeptides that interact with one or more KChlPl polypeptides. In such a yeast 
two-hybrid system, vectors are constructed based on the flexibility of a transcription 
5 factor that has two functional domains (a DNA binding domain and a transcription 
activation domain). If the two domains are separated but fused to two different 
proteins that interact with one another, transcriptional activation can be achieved, and 
transcription of specific markers (e.g., nutritional markers such as His and Ade, or 
color markers such as lacZ) can be used to identify the presence of interaction and 

10 transcriptional activation. For example, in the methods of the invention, a first vector 
is used which includes a nucleic acid encoding a DNA binding domain and also a 
KChlPl polypeptide, splicing variant, or fragment or derivative thereof, and a second 
vector is used which includes a nucleic acid encoding a transcription activation 
domain and also a nucleic acid encoding a polypeptide which potentially may interact 

15 with the KChlPl polypeptide, splicing variant, or fragment or derivative thereof (e.g\ 
a KChIP 1 polypeptide binding agent or receiptor). Incubation of yeast containing the 
first vector and the second vector under appropriate conditions (e.g. , mating 
conditions such as used in the Matchmaker™ system from Clontech (Palo Alto, 
California, USA)) allows identification of colonies that express the markers of 

20 interest. These colonies can be examined to identify the polypeptide(s) that interact 
with the KChlPl polypeptide or fragment or derivative thereof. Such polypeptides 
may be useful as agents that alter the activity of expression of a KChlPl polypeptide, 
as described above. 

In more than one embodiment of the above assay methods of the present 

25 invention, it may be desirable to immobilize either the KChlPl gene or nucleic acid, 
the KChlPl polypeptide, the KChlPl binding agent, or other components of the assay 
on a solid support, in order to facilitate separation of complexed from uncomplexed 
forms of one or both of the polypeptides, as well as to accommodate automation of 
the assay. Binding of a test agent to the polypeptide, or interaction of the polypeptide 

30 with a binding agent in the presence and absence of a test agent, can be accomplished 
in any vessel suitable for containing the reactants. Examples of such vessels include 
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inicrotitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 
protein {e.g., a glutathione-S-transferase fusion protein) can be provided which adds a 
domain that allows a KChlPl nucleic acid, KChlPl polypeptide, or a KChlPl binding 
agent to be bound to a matrix or other solid support. 
5 In another embodiment, modulators of expression of nucleic acid molecules of 

the invention are identified in a method wherein a cell, cell lysate, or solution 
containing a KChlPl nucleic acid is contacted with a test agent and the expression of 
appropriate mRNA or polypeptide (e.g., splicing variants)) in the cell, cell lysate, or 
solution, is determined. The level of expression of appropriate mRNA or 

10 polypeptide(s) in the presence of the test agent is compared to the level of expression 
of mRNA or polypeptide(s) in the absence of the test agent. The test agent can then 
be identified as a modulator of expression based on this comparison. For example, 
when expression of mRNA or polypeptide is greater (statistically significantly 
greater) in the presence of the test agent than in its absence, the test agent is identified 

15 as a stimulator or enhancer of the mRNA or polypeptide expression. Alternatively, 
when expression of the mRNA or polypeptide is less (statistically significantly less) in 
the presence of the test agent than in its absence, the test agent is identified as an 
inhibitor of the mRNA or polypeptide expression. The level of mRNA or polypeptide 
expression in the cells can be determined by methods described herein for detecting 

20 mRNA or polypeptide. 

This invention further pertains to hovel agents identified by the above- 
described screening assays. Accordingly, it is within the scope of this invention to 
further use an agent identified as described herein in an appropriate animal model. 
For example, an agent identified as described herein (e.g., a test agent that is a 

25 modulating agent, an antisense nucleic acid molecule, a specific antibody, or a 

polypeptide-binding agent) can be used in an animal model to determine the efficacy, 
toxicity, or side effects of treatment with such an agent. Alternatively, an agent 
identified as described herein can be used in an animal model to determine the 
mechanism of action of such an agent 

30 Furthermore, this invention pertains to uses of novel agents identified by the 

above-described screening assays for treatments as described herein. In addition, an 
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agent identified as described herein can be used to alter activity of a polypeptide 
encoded by a KChlPl nucleic acid, or to alter expression of a KCMP1 nucleic acid, 
by contacting the polypeptide or the nucleic acid (or contacting a cell comprising the 
polypeptide or the nucleic acid) with the agent identified as described herein. 
5 * 

PHARMACEUTICAL COMPOSITIONS * 

The present invention also pertains to pharmaceutical compositions 
comprising nucleic acids described herein, particularly nucleotides encoding the 
polypeptides described herein (e.g., aKCbJPl polypeptide); comprising polypeptides 

10 described herein and/or comprising other splicing variants encoded by a KChlPl 
nucleic acid; and/or an agent that alters (eg., enhances or inhibits) KChlPl nucleic 
acid expression or KChlPl polypeptide activity as described herein. For instance, a 
polypeptide, protein (e.g., a KChlPl nucleic acid receptor), an agent that alters 
KChlPl nucleic acid expression, or a KChlPl binding agent or binding partner, 

15 fragment, fusion protein or pro-drug thereof, or a nucleotide or nucleic acid construct 
(vector) comprising a nucleotide of the present invention, or an agent that alters 
KChlPl polypeptide activity, can be formulated with a physiologically acceptable 
carrier or excipient to prepare a pharmaceutical composition. The carrier and 
composition can be sterile. The formulation should suit the mode of administration. 

20 Suitable pharmaceutically acceptable carriers include but are not limited to 

water, salt solutions (e.g., NaCI), saline, buffered saline, alcohols, glycerol, ethanol, 
gum arabic, vegetable oils, benzyl alcohols, polyethylene glycols, gelatin, 
carbohydrates such as lactose, amylose or starch, dextrose, magnesium stearate, talc, 
silicic acid, viscous paraffin, perfume oil, fatty acid esters, hydroxymethylcellulose, 

25 polyvinyl pyrolidone, etc., as well as combinations thereof The pharmaceutical 
preparations can, if desired, be mixed with auxiliary agents, e.g., lubricants, 
preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic 
pressure, buffers, coloring, flavoring and/or aromatic substances and the like which do 
not deleteriously react with the active agents. 

30 The composition, if desired, can also contain minor amounts of wetting or 

emulsifying agents, or pH buffering agents. The composition can be a liquid solution, 
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suspension, emulsion, tablet, pill, capsule, sustained release formulation, or powder. 
The composition can be formulated as a suppository, with traditional binders and 
carriers such as triglycerides. Oral formulation can include standard carriers such as 
pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, polyvinyl 
5 pyrollidone, sodium saccharine, cellulose, magnesium carbonate, etc. 

Methods of introduction of these compositions include, but are not limited to, 
intradermal, intramuscular, intraperitoneal, intraocular, intravenous, subcutaneous, 
topical, oral and intranasal. Other suitable methods of introduction can also include 
gene therapy (as described below), rechargeable or biodegradable devices, particle 

10 acceleration devises ("gene guns") and slow release polymeric devices. The 

pharmaceutical compositions of this invention can also be administered as part of a 
combinatorial therapy with other agents. 

The composition can be formulated in accordance with the routine procedures 
as a pharmaceutical composition adapted for administration to human beings. For 

15 example, compositions for intravenous administration typically are solutions in sterile 
isotonic aqueous buffer. Where necessary, the composition may also include a 
solubilizing agent and a local anesthetic to ease pain at the site of the injection. 
Generally, the ingredients are supplied either separately or mixed together in unit 
dosage form, for example, as a dry lyophilized powder or water free concentrate in a 

20 hermetically sealed container such as an ampule or sachette indicating the quantity of 
active agent. Where the composition is to be administered by infusion, it can be 
dispensed with an infusion bottle containing sterile pharmaceutical grade water, saline 
or dextrose/water. Where the composition is administered by injection, an ampule of 
sterile water for injection or saline can be provided so that the ingredients may be 

25 mixed prior to administration. 

For topical application, nonsprayable forms, viscous to semi-solid or solid 
forms comprising a carrier compatible with topical application and having a dynamic 
viscosity preferably greater than water, can be employed. Suitable formulations 
include but are not limited to solutions, suspensions, emulsions, creams, ointments, 

30 powders, enemas, lotions, sols, liniments, salves, aerosols, etc., which are, if desired, 
sterilized or mixed with auxiliary agents, e.g., preservatives, stabilizers, wetting 
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agents, buffers or salts for influencing osmotic pressure, etc. The agent may be 
incorporated into a cosmetic formulation. For topical application, also suitable are 
sprayable aerosol preparations wherein the active ingredient, preferably in 
combination with a solid or liquid inert carrier material, is packaged in a squeeze 
5 bottle or in admixture with a pressurized volatile, normally gaseous propellant, e.g., 
pressurized air. 

Agents described herein can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with free amino groups such 
as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and 

10 those formed with free carboxyl groups such as those derived from sodium, 

potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 2- 
ethylamino ethanol, histidirie, procaine, etc. 

The agents are administered in a therapeutically effective amount. The 
amount of agents which will be therapeutically effective in the treatment of a 

15 particular disorder or condition will depend on the nature of the disorder or condition, 
and can be determined by standard clinical techniques. In addition, in vitro or in vivo 
assays may optionally be employed to help identify optimal dosage ranges. The 
precise dose to be employed in the formulation will also depend on the route of 
administration, and the seriousness of the symptoms, and should be decided according 

20 to the judgment of a practitioner and each patient's circumstances. Effective doses 
may be extrapolated from dose-response curves derived from in vitro or animal model 
test systems. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 

25 compositions of the invention. Optionally associated with such coritainer(s) can be a 
notice in the form prescribed by a governmental agency regulating the manufacture, 
use or sale of pharmaceuticals or biological products, which notice reflects approval 
by the agency of manufacture, use of sale for human administration. The pack or kit 
can be labeled with information regarding mode of administration, sequence of drug 

30 administration {e.g., separately, sequentially or concurrently), or the like. The pack or 
kit may also include means for reminding the patient to take the therapy. The pack or 
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lcit can be a single unit dosage of the combination therapy or it can be a plurality of 
unit dosages. In particular, the agents can be separated, mixed together in any 
combination, present in a single vial or tablet. Agents assembled in a blister pack or 
other dispensing means is preferred. For the purpose of this invention, unit dosage is 
5 intended to mean a dosage that is dependent on the individual pharmacodynamics of 
each agent and administered in FDA approved dosages in standard time courses. 

METHODS OF THERAPY 

The present invention also pertains to methods of treatment (prophylactic 
10 and/br therapeutic) for certain diseases and conditions associated with KChIP 1. In 
particular, the invention relates to methods of treatment for Type II diabetes or a 
susceptibility to Type II diabetes, using a Type II diabetes therapeutic agent. A "Type 
H diabetes therapeutic agent'* is an agent that alters (e.g., enhances or inhibits) 
KChTPl polypeptide activity and/or KCWPl nucleic acid expression, as described 
15 herein (e.g. , a Type II diabetes nucleic acid agonist or antagonist), hi certain 

embodiments, the Type II diabetes therapeutic agent alters activity and/or nucleic acid 
expression of KChlPl. 

Type II diabetes therapeutic agents can alter KChlPl polypeptide activity or 
nucleic acid expression by a variety of means, such as, for example, by providing 
20 additional KChlPl polypeptide or by upregulating the transcription or translation of 
the KChIP 1 nucleic acid; by altering posttranslational processing of the KChIP 1 
polypeptide; by altering transcription of KChlPl splicing variants; or by interfering 
with KChIP 1 polypeptide activity (e.g. , by binding to a KChIP 1 polypeptide), or by 
binding to another polypeptide that interacts with KChIP 1 , by altering (e.g., 
25 downregulating) the expression, transcription or translation of a KChlPl nucleic acid, 
or by altering (e.g., agonizing or antagonizing) activity, 

Representative Type II diabetes therapeutic agents include the following: 

nucleic acids or fragments or derivatives thereof described herein, particularly 
30 nucleotides encoding the polypeptides described herein and vectors 

comprising such nucleic acids (e.g., a gene, cDNA, and/or mRNA, such as a 
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nucleic acid encoding a KChlPl polypeptide or active fragment or derivative 
thereof, or an oligonucleotide; or a complement thereof, or fragments or 
derivatives thereof, and/or other splicing variants encoded by a Type n 
diabetes nucleic acid, or fragments or derivatives thereof); 

5 

polypeptides described herein and/ or splicing variants encoded by the 
KChlPl nucleic acid or fragments or derivatives thereof; 

other polypeptides (e.g., KChlPl receptors); KChlPl binding agents; or agents 
10 that affect (e.g., increase or decrease) activity, 

antibodies, such as an antibody to an altered KChlPl polypeptide, or an 
antibody to a non-altered KChlPl polypeptide, or an antibody to a particular 
splicing variant encoded by a KChlPl nucleic acid as described above; 

peptidomimetics; fusion proteins or prodrugs thereof; ribozymes; other small 
molecules; and 

other agents that alter (e.g., enhance or inhibit) expression of a KChlPl 
20 nucleic acid, or that regulate transcription of KChlPl splicing variants (e.g., 

agents that affect which splicing variants are expressed, or that affect the 
amount of each splicing variant that is expressed). 

More than one Type IT diabetes therapeutic agent can be used concurrently, if desired. 

25 A Type II diabetes nucleic acid therapeutic agent that is a nucleic acid is used 

in the treatment of Type II diabetes or in the treatment for a susceptibility to Type II 
diabetes. The term, "treatment" as used herein, refers not only to ameliorating 
symptoms associated with the disease or condition, but also preventing or delaying 
the onset of the disease or condition, and also lessening the severity or frequency of 

30 symptoms of the disease or condition. The therapy is designed to alter (e.g., inhibit or 
enhance), replace or supplement activity of a KChlPl polypeptide in an individual. 
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For example, a Type II diabetes therapeutic agent can be administered in order to 
upregulate or increase the expression or availability of the KChlPl nucleic acid or of 
specific splicing variants of KChlPl nucleic acid, or, conversely, to downreguiate or 
decrease the expression or availability of the KChlPl nucleic acid or specific splicing 
5 variants of the KChlPl nucleic acid. Upregulation or increasing expression or 

availability of a native KChlPl gene or nucleic acid or of a particular splicing variant 
could interfere with or compensate for the expression or activity of a defective gene or 
another splicing variant; downregulation or decreasing expression or availability of a 
native KChlPl gene or of a particular splicing variant could minimize the expression 

10 or activity of a defective gene or the particular splicing variant and thereby minimize 
the impact of the defective gene or the particular splicing variant, i 

The Type II diabetes therapeutic agent(s) are administered in a therapeutically 
effective amount (i.e., an amount that is sufficient to treat the disease, such as by 
ameliorating symptoms associated with the disease, preventing or delaying the onset 

15 of the disease, and/or also lessening the severity or frequency of symptoms of the 
disease). The amount which will be therapeutically effective in the treatment of a 
particular individual's disorder or condition will depend on the symptoms and 
severity of the disease, and can be determined by standard clinical techniques. In 
addition, in vitro or in vivo assays may optionally be employed to help identify 

20 optimal dosage ranges. The precise dose to be employed in the formulation will also 
depend on the route of administration, and the seriousness of the disease or disorder, 
and should be decided according to the judgment of a practitioner and each patient's 
circumstances. Effective doses may be extrapolated from dose-response curves 
derived from in vitro or animal model test systems. 

25 In one embodiment, a nucleic acid of the invention {e.g., a nucleic acid 

encoding a KChlPl polypeptide, such as one of SEQ ID NO: 1 or a complement 
thereof); or another nucleic acid that encodes a KChlPl polypeptide or a splicing 
variant, derivative or fragment thereof (e.g., comprising any one or more of SEQ ID 
NO: 1 14-258), can be used, either alone or in a pharmaceutical composition as 

30 described above. For example, a KChlPl gene or nucleic acid or a cDNA encoding a 
KChlPl polypeptide, either by itself or included within a vector, can be introduced 
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into cells (either in vitro or in vivo) such that the cells produce native KChlPl 
polypeptide. If necessary, cells that have been transformed with the gene or cDNA or 
a vector comprising the gene, nucleic acid or cDNA can be introduced (or re- 
introduced) into an individual affected with the disease. Thus, cells which, in nature, 
5 lack native KChlPl expression and activity, or have altered KChlPl expression and 
activity, or have expression of a disease-associated KChlPl splicing variant, can be 
engineered to express the KChlPl polypeptide or an active fragment of the KChlPl 
polypeptide (or a different variant of the KChlPl polypeptide). In certain 
embodiments, nucleic acids encoding a KChlPl polypeptide, or an active fragment or 
10 derivative thereof, can be introduced into an expression vector, such as a viral vector, 
and the vector can be introduced into appropriate cells in an animal. Other gene 
transfer systems, including viral and nonviiral transfer systems, can be used. 
Alternatively, nonviral gene transfer methods, such as calcium phosphate 
coprecipitation, mechanical techniques (e.g., microinjection); membrane fiision- 
15 mediated transfer via liposomes; or direct DNA uptake, can also be used. 

Alternatively, in another embodiment of the invention, a nucleic acid of the 
invention; a nucleic acid complementary to a nucleic acid of the invention; or a 
portion of such a nucleic acid {e.g., an oligonucleotide as described below), can be 
used in "antisense" therapy, in which a nucleic acid (e.g., an oligonucleotide) which 
20 specifically hybridizes to the mRNA and/or genomic DNA of a Type II diabetes gene 
is administered or generated in situ. The antisense nucleic acid that specifically 
hybridizes to the mRNA and/or DNA inhibits expression of the KChlPl polypeptide, 
e.g., by inhibiting translation and/or transcription. Binding of the antisense nucleic 
acid can be by conventional base pair complementarity, or, for example, in the case of 
25 binding to DNA duplexes, through specific interaction in the major groove of the 
double helix. 

An antisense construct of the present invention can be delivered, for example, 
as an expression plasmid as described above. When the plasmid is transcribed in the 
cell, it produces RNA that is complementary to a portion of the mRNA and/or DNA 
30 which encodes the KChlPl polypeptide. Alternatively, the antisense construct can be 
an oligonucleotide probe that is generated ex vivo and introduced into cells; it then 



2004041 1 93A2_I_> 



WO 2004/041193 



-79- 



PCT/US2003/034681 



inhibits expression by hybridizing with the mRNA and/or genomic DNA of the 
polypeptide. In one embodiment, the oligonucleotide probes are modified 
oligonucleotides, which are resistant to endogenous nucleases, e.g., exonucleases 
and/or endonucleases, thereby rendering them stable in vivo. Exemplary nucleic acid 
5 molecules for use as antisense oligonucleotides are phosphoramidate, phosphothioate 
and methylphosphonate analogs of DNA (see also U.S. Pat. Nos. 5,176,996; 
5,264,564; and 5,256,775). Additionally, general approaches to constructing 
oligomers useful in antisense therapy are also described, for example, by Van der 
Krol et al, (BioTechniques 6:958-976 (1988)); and Stein et aL, (Cancer Res. 48:2659- 
10 2668 (1988)). With respect to antisense DNA, oligodeoxyribonucleotides derived 
from the translation initiation site are preferred. 

To perform antisense therapy, oligonucleotides (mRNA, cDNA or DNA) are 
designed that are complementary to mRNA encoding the KChlPL The antisense 
oligonucleotides bind to KChlPl mRNA transcripts and prevent translation. Absolute 
15 complementarity, although preferred, is not required. A sequence "complementary" 
to a portion of an RNA, as referred to herein, indicates that a sequence has sufficient 
complementarity to be able to hybridize with the RNA, forming a stable duplex; in the 
case of double-stranded antisense nucleic acids, a single strand of the duplex DNA 
may thus be tested, or triplex formation may be assayed. The ability to hybridize will 
20 depend on both the degree of complementarity and the length of the antisense nucleic 
acid, as described in detail above. Generally, the longer the hybridizing nucleic acid, 
the more base mismatches with an RNA it may contain and still form a stable duplex 
(or triplex, as the case may be). One skilled in the art can ascertain a tolerable degree 
of mismatch by use of standard procedures. 
25 The oligonucleotides used in antisense therapy can be DNA, RNA, or 

chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotides can be modified at the base rhoiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oHgonucleotides can include other appended groups such as 
30 peptides (e.g. for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al, Proc. Natl. Acad. Sci. USA 
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86:6553-6556 (1989); Lemaitre et al, Proc. Natl. Acad. Sci. USA 84:648-652 (1987); 
PCT International Publication NO: WO 88/09810) or the blood-brain barrier (see, 
e.g., PCT International Publication NO: WO 89/10134), or hybridization-triggered 
cleavage agents (see, e.g., Krol et ah, BioTechniques 6:958-976 (1 988)) or 
5 intercalating agents. (See, e^., Zon, Pharm. Res. 5:539-549(1988)). To this end, the 
oligonucleotide may be conjugated to another molecule {e.g. , a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent). 

The antisense molecules are delivered to cells that express KChlPl in vivo. A 
number of methods can be used for delivering antisense DNA or RNA to cells; e.g., 

10 antisense molecules can be injected directly into the tissue site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or 
antibodies that specifically bind receptors or antigens expressed on the target cell 
surface) can be administered systematically. Alternatively, in a preferred 
embodiment, a recombinant DNA construct is utilized in which the antisense 

15 oligonucleotide is placed under the control of a strong promoter (e.g., pol IH or pol 
II). The use of such a construct to transfect target cells in the patient results in the 
transcription of sufficient amounts of single stranded RNAs that will form 
complementary base pairs with the endogenous KChlPl transcripts and thereby 
prevent translation of the KChlPl mRNA. For example, a vector can be introduced in 

20 vivo such that it is taken up by a cell and directs the transcription of an antisense 
RNA. Such a vector can remain episomal or become chromosorhally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can 
be constructed by recombinant DNA technology methods standard in the art and 
described above. For example, a plasmid, cosmid, YAC or viral vector can be used to 

25 prepare the recombinant DNA construct that can be introduced directly into the tissue 
site. Alternatively, viral vectors can be used which selectively infect the desired 
tissue, in which case administration may be accomplished by another route (e.g., 
systemically). 

Endogenous KChlPl polypeptide expression can also be reduced by 
30 inactivating or tc knocking out" the gene, nucleic acid or its promoter using targeted 
homologous recombination (e.g., see Smithies et al, Nature 317:230-23 4 (1985); 
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Thomas & Capecchi, Cell 51:503-512 (1987); Thompson et al, Cell 5:313-321 
(1989)). For example, an altered, non-functional gene or nucleic acid (or a 
completely unrelated DNA sequence) flanked by DNA homologous to the 
endogenous gene or nucleic acid (either the coding regions or regulatory regions of 
5 the nucleic acid) can be used, with or without a selectable marker and/or a negative 
selectable marker, to transfect cells that express the gene or nucleic acid in vivo. 
Insertion of the DNA construct, via targeted homologous recombination, results in 
inactivation of the gene or nucleic acid. The recombinant DNA constructs can be 
directly administered or targeted to the required site in vivo using appropriate vectors, 
10 as described above. Alternatively, expression of non-altered genes or nucleic acids 
can be increased using a similar method: targeted homologous recombination can be 
used to insert a DNA construct comprising a non-altered functional gene or nucleic 
acid, e.g., a nucleic acid comprising one or more of SEQ ID NOs: 1 14-258 or the 
complement thereof, or a portion thereof, in place of an altered KChiPl in the cell, as 
15 described above. In another embodiment, targeted homologous recombination can be 
used to insert a DNA construct comprising a nucleic acid that encodes a Type H 
diabetes polypeptide variant that differs from that present in the cell. 

Alternatively, endogenous KChIP 1 nucleic acid expression can be reduced by 
targeting deoxyribonucleotide sequences complementary to the regulatory region of a 
20 KChiPl nucleic acid (i.e., the KChiPl promoter and/or enhancers) to form triple 

helical structures that prevent transcription of the KCMP1 nucleic acid in target cells 
in the body. (See generally, Helene, ^.Anticancer DrugDes., 6(6):569-84 (1991); 
Helene, C. et al.,Ann. N.Y. Acad. Sci. 660:27-36 (1992); and Maher, L. J., Bioassays 
14(12):807-15 (1992)). Likewise, the antisense constructs described herein, by 
25 antagonizing the normal biological activity of one of the KChiPl proteins, can be 
used in the manipulation of tissue, e.g., tissue differentiation, both in vivo and for ex 
vivo tissue cultures. Furthermore, the anti-sense techniques (e.g., microinjection of 
antisense molecules, or transfection with plasmids whose transcripts are anti-sense 
with regard to a Type E diabetes gene mRNA or gene sequence) can be used to 
30 investigate the role of KChiPl or the interaction of KChiPl and its binding agents in 
developmental events, as well as the normal cellular function of KChTPl or of the 
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interaction of KCMPl and its binding agents in adult tissue. Such techniques can be 
utilized in cell culture, but can also be used in the creation of transgenic animals. 

In yet another embodiment of the invention, other Type II diabetes therapeutic 
agents as described herein can also be used in the treatment or prevention of a 
5 susceptibility to a disease or condition associated with a Type II diabetes gene. The 
therapeutic agents can be delivered in a composition, as described above, or by 
themselves. They can be administered systemically, or can be targeted to a particular 
tissue. The therapeutic agents can be produced by a variety of means, including 
chemical synthesis; recombinant production; in vivo production (e.g., a transgenic 
10 animal, such as U.S. Pat. NO: 4,873,316 to Meade et al), for example, and can be 
isolated using standard means such as those described herein. 

A combination of any of the above methods of treatment (e.g. , administration 
of non-altered polypeptide in conjunction with antisense therapy targeting altered 
mRNA of KChlPl ; administration of a first splicing variant encoded by a KChlPl 
15 nucleic acid in conjunction with antisense therapy targeting a second splicing encoded 
by a KChIP 1 nucleic acid) can also be used. 

The present invention is now illustrated by the following Exemplification, 
which is not intended to be limiting in any way. All references cited herein are 
incorporated by reference in their entirety. 

20 

EXEMPLIFICATION 

The study was done in collaboration with the Icelandic Heart Association, who 
provided an encrypted list of 1350 diabetic patients. In 1967-1991 the Heart 
Association started a study of cardiovascular disease and its complications. 

25 Measurements of blood sugar were included in a thorough check-up of the 

participants which results led to many individuals being diagnosed with diabetes. The 
list of participants is an unbiased sample of about a third of the Icelandic nation. 
Individuals diagnosed in the years following 1991 were either diagnosed at the 
Icelandic Heart Association or at one of two major hospitals in Reykjavik, Iceland. 

30 All participants in the Type II diabetes study visited the Icelandic Heart 

Association where each answered a questionnaire, had blood drawn, a blood sugar 
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assessment, and measurements taken. Height (m) and weight (leg) were measured to 
calculate the body mass index. In serum, the fasting blood glucose and triglyceride 
levels were measured as well. Diagnoses of Type n diabetes were based on the 
diagnostic criteria set by the World Health Organization (1999). All patients with 
5 fasting glucose above 7 mM were diagnosed as having Type II diabetes and 

individuals with fasting blood sugar between 6.1 - 6.9 mM were diagnosed with 
impaired fasting glucose. If the participants had no prior history of diabetes, they 
were requested to come in for another test to have their diagnosis confirmed. All 
individuals on diabetic medication were classified as Type II. The questionnaire 
10 included questions regarding age at diagnosis and type of medication. All patients 
were requested to bring two relatives who's DNA was used to confirm the 
genetotypes of the patients, 

Since the patients had participated in a study that was conducted between 
1967-1991 a considerable time had passed, in some instances, since they had visited 
15 the Heart Association. Therefore, all the patients were required to have another 
fasting blood glucose test to check on their blood sugar level at the time of 
participation in the study. Thus, all patients were labeled unconfirmed, meaning that 
results of blood glucose levels were pending, for this particular study. A label of 
confirmed diabetic was given to the patient when the measurements were received. 
20 Linkage analyses were done with confirmed patients and unconfirmed patients were 
included only if they were close relatives of a confirmed index patient. The initial list 
of patients included 1350 Type II diabetics, but during this study new patients were 
diagnosed who were relatives of the index patients. All participants with no previous 
history of diabetes but with elevated fasting glucose were diagnosed according to the 
25 WHO criteria as described above. At present date, 1406 Type II diabetics and 266 
patients with impaired fasting glucose have participated in the study, together with 
3972 of their close relatives. 

This study was approved by the Data Protection Commission of Iceland and 
the National Bioethics Committee of Iceland. All patients and their relatives who 
30 participated in the Study gave informed consents. 
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Outline of the study 

This particular genetic study, which has the aim of identifying a genetic 
variant or a gene that may contribute to type II diabetes by using a positional cloning 
approach, can be divided into three steps: 
5 u Genome-wide linkage study, where excess allele sharing among related 

type II diabetics is used to identify a chromosomal segment, typically 2 
- 8 Megabases long, that may harbor a disease susceptibility 
gene/genes. 

ii. Locus-wide association study, where a high-density of microsatellite 
10 markers is typed in a large patient and control cohort. By comparing 

the frequencies of individual alleles or haplotypes between the two 
cohorts, the location of the putative disease gene/genes is narrowed 
down to a few hundred kilobases. 

15 iii. Candidate gene assessment, where additional rnicrosatellites and/or 

SNPs are typed in all genes that are identified within the smaller 
candidate region and further association analysis is used to identify 
which of the genes shows strong association to the disease. 

20 Linkage analysis 

Pedigree Construction 

For the linkage analysis, blood samples were obtained from 964 Type II 
diabetics and 203 individuals with impaired fasting glucose. The patients were 
clustered into families such that each patient is related to (within and including six 

25 meiotic events) at least one other patient. In this manner, 772 patients fell into 

families - 705 Type II diabetics and 67 with impaired fasting glucose. The confirmed 
Type II patients were treated as probands and clustered into families that each 
proband is related to, within and including six meiotic events. The other patients, 
unconfirmed Type II and IFG patients, were added to the families if they were related 

30 to a proband within and including three meiotic events. The rational behind this was 
to include as many patients as possible in the study. Impaired fasting glucose is an 
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immediate diagnosis, and we assumed that the more closely related these patients are 
to the confirmed diabetics, the likelier they are to have or to develop the disease. 

The families were checked for relationship errors by comparing the identity-by 
state (IBS) distribution for the set of 906 markers, for each pair of related and 
5 genotyped individuals, to a reference distribution corresponding to the particular 
degree of relatedness. The reference distributions were constructed from a large 
subset of the Icelandic population. Individuals were excluded from the study if their 
relationship with the rest of the family was inconsistent with the relationship specified 
in the geneology databse. 
10 The remaining material that was available for the study was the following: 

763 now confirmed Type II patients in 227 families together with 764 genotyped 
relatives. Of the patients, 667 were confirmed Type n patients, 35 unconfirmed Type 
II patients, 52 confirmed patients with impaired fasting glucose (TFG) and 9 
unconfirmed patients with EFG. 

15 

Stratification of the Patient Material 

The patients were classified into two sub-phenotypes based on their BMI: 
non-obese Type II diabetes are patients who have BMI less than 30, and obese Type II 
diabetes are patients who have BMI at or above 30. The reason for fractionating the 

20 diabetics into non-obese and obese groups is that other factors may be influencing the 
pathogenesis of disease in these two groups. Obesity alone could be contributing to 
the diabetic phenotype. Therefore, this factor was separated. Obesity is most likely 
due to a combination of environmental and genetic factors. This fractionation into 
non-obese and obese diabetics practically separates the material into two halves; 60% 

25 of the patients are in the non-obese category (20% with BMI below 25 (lean) and 40% 
with BMI between 25-30 (overweight)), and 40% of the patients are in the obese 
category (BMI above 30). 

An affected-only linkage analysis for each of those sub-phenotypes was 
performed, using the same set of families as above, but classifying patients not 

30 belonging to the particular sub-group as having an unknown disease status. Restricted 
to a particular sub-pheiiotype, some families no longer contain a pair of related . 



tNSDOCID: <WO 2004041 193 A2J_> 



WO 2004/041193 - PCT/US2003/034681 

-86- 

patients classified as affecteds and hence do not contribute in the linkage analysis. 
Such families were excluded from the analysis of the particular sub-phenotype. The 
number of patients and families used in the linkage analysis is summarized in Table 1 
below. 

5 Table 1 : The number of patients and families that contribute to the genome- 

wide linkage scan, both when all the patients are used, and when the analysis is 
restricted to obese or non-obese diabetic patients, respectively. 



Table 1 : Phenotype and Patients 



Phenotype 


Total Number 
of Patients 


NO: of families 
contributing to 
the analysis 


NO: of patients 
contributing to 
the analysis 


All diabetics 


763 


227 


763 


Obese 


296 


92 


219 


Non-obese 


467 


154 


413 



10 

Genome wide scan 

A genome wide scan was performed on 772 patients and their relatives. Nine 
patients were excluded due to inheritance errors so the linkage analysis was 

15 performed with 763 patients and 764 relatives. The procedure was as described in 
Gretarsdottir, et aL, Am J Hum Genet., 70(3):593-603 (2002). In short, the DNA was 
genotyped with a framework marker set of 906 microsatellite markers with an average 
resolution of 4cM. Alleles were called automatically with the TrueAllele program 
(Cybergenetics, Co., Pittsburgh, PA), and the program DecodeGT (deCODE genetics, 

20 ehf , Iceland), was used to fractionate according to quality and edit the called 
genotypes (Palsson, B., et al 9 Genome Res., 9(10):1002-1012 (1999)). The 
population allele frequencies for the markers were constructed from a cohort of more 
than 30,000 Icelanders that have participated in genome- wide studies of various 
disease projects at deCODE genetics. Additional markers were genotyped within the 

25 locus on chromosome 5q, where we observed the strongest linkage signal, to increase 
the information on identity by descent (IBD) sharing within the families. For those 
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markers, at least 180 Icelandic controls were genotyped to derive the population allele 
frequencies. 

The additional microsatellite markers that were genotyped within the locus 
were either publicly available or designed at deCODE genetics; those markers are 
5 indicated with a DG designation. Repeats within the DNA sequence were identified 
that allowed us to choose or design primers that were evenly spaced across the locus. 
The identification of the repeats and location with respect to other markers was based 
on the work of the physical mapping team at deCODE genetics. 

For the markers used in the genomewide scan, the genetic positions were 
10 taken from the recently published high-resolution genetic map (HRGM), constructed 
at deCODE genetics (Kong A., et al 9 Nat Genet, 31: 241-247 (2002)). The genetic 
position of the additional markers are either taken from the HRGM, when available, 
or by applying the same genetic mapping methods as were used in constructing the 
HRGM map to the family material genotyped for this particular linkage study. 

15 

Statistical Methods for Linlcage Analysis 

r The linlcage analysis is done using the software Allegro (Gudbjartsson et aL, 
Nat Genet. 25:12-3, (2000)) that determines the statistical significance of excess 
sharing among related patients by applying non-parametric affected-only allele- 

20 sharing methods (without any particular disease inheritance model being specified). 
Allegro, a linkage program developed at deCODE genetics, calculates LOD scores 
based on multipoint calculations. Our baseline linkage analysis uses the S pa irs scoring 
function (Whittemore, A.S. and Halpern, J., Biometrics 50:118-27 (1994); Kruglyak 
L, etaL.Am J Hum Genet 58:1347-63, (1996)), the exponential allele-sharing model 

25 (Kong, A. and Cox, NJ., Am. J. Hum. Genet., 61:1 179 (1997)), and a family 

weighting scheme which is halfway on a log scale between weighting each affected 
pair equally and weighting each family equally. In the analysis, all genotyped 
individuals who are not affected are treated as "unknown". Because of concern with 
small sample behavior, we usually compute corresponding P-values in two different 

30 ways for comparison. The first P -value is computed based on large sample theory; Zj r 
= ^(2 loge (10) LOD) and is approximately distributed as a standard normal 
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distribution under the null hypothesis of no linkage. A second P-value is computed 
by comparing the observed LOD score to its complete data sampling distribution 
under the null hypothesis. When a data set consists of more than a handful of 
families, these two P-values tend to be very similar. 
5 All suggestive loci with LOD scores greater than 2 are followed up with some 

extra markers to increase the information on the EBD-sharing within the families and 
to decrease the chance that a LOD score represents a false-positive linkage. The 
information measure we use was defined by Nicolae (D. L. Nicolae, Thesis, 
University of Chicago (1999)) and is a part of the Allegro program output. This 
1 0 measure is closely related to. a classical measure of information as previously 
described by Dempster et.al. (Dempster, A.P., et al, J. R. Statist. Soc. B, 39:1 
(1977)); the information equals zero if the marker genotypes are completely 
uninformative and equals one if the genotypes determine the exact amount of allele 
sharing by descent among the affected relatives. Using the framework marker set 
15 with average marker spacing of 4 cM typically results in information content of about 
0.7 in the families used in our linkage analysis. Increasing the marker density to one 
marker every centimorgan usually increases the information content above 0.85. 

Results 

20 The results of the genome-wide linkage analysis with the framework marker 

set are shown in FIG. 4 which depicts the aUele-sharing LOD-score versus the genetic 
distance from the p-terminus in centimorgan (cM) for each of the 23 chromosomes. 
The analysis was performed with the three phenotypes: all Type E diabetics (solid 
lines), non-obese diabetics (dashed lines) and obese diabetics (dotted lines). A LOD- 

25 score of 1 .84 is observed on chromosome 5q34-q35.2 with the framework marker set 
when we use all Type H diabetics in the analysis. When the linkage analysis is 

restricted to non-obese diabetes, this LOD-score increases to 2.81. The obese 
diabetics do not show linkage in this region. 

Additional markers were genotyped in this area to increase the information 
30 content and to confirm the linkage. The information on the IBD-sharing at this locus 
was about 78% with the framework marker set. In order to increase the information 



INSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 

-89 

content, another 38 microsatellite markers were genotyped within a 40 cM region that 
includes the observed signal. Repeating the linkage analysis including the additional 
markers increased the LOD-score to 3.64 (P-vahie = 3. 18xl0" 5 ) for the non-obese 
diabetics. For all patients, the peak LOD-score increased to 2.9 (P-value = 1.22X10" 4 ). 
This is shown in FIG. 5. 

The peak of the LOD-score is centered on marker D5S625 and the region 
determined by a drop of one in the LOD is from marker DG5S5 to marker D5S429, 
centromeric and telomeric respectively. The one-LOD-drop is about 9 cM and 
estimated to be about 3.5 Mb. This 1-LOD-drop roughly corresponds to the 80-90% 
confidence interval for the location of a putative disease associated gene. 

Locus-wide association study 

Genotyping to Naii'ow Down the Region of Linkage 

In order to narrow down the region of interest, the linkage analysis is followed 
15 by a comprehensive association study of the 1-LOD-drop. This is necessary as the 
linkage analysis has limited resolution; it compares sharing among closely related 
individuals that share on average large chromosomal segments. For the association 
analysis, we identified a large number of additional microsatellite markers located in 
, the 1-LOD-drop and typed those markers in both our patient cohort and in a large 
20 number of unrelated controls randomly selected from the Icelandic population. 

We identified and typed 67 markers in the 1-LOD-drop in addition to the 17 
markers already typed and used in the linkage analysis (locus-wide association 
micorsatellites; Table 6). The new polymorphic repeats (dinucleotide or trinucleotide 
repeats) were identified with the Sputnik program. We subtracted the smaller allele of 
25 CEPH sample 1 347-02 (CEPH genomics repository) from the alleles of the 

microsatellites and used it as a reference. A total of 84 markers were available for the 
association analysis, i.e., an average density of one marker every 42kb or one marker 
every 0.107 cM. All those markers were typed for 590 non-obese diabetics and 477 
unrelated controls. 

30 
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Statistical Methods for Association and Haplotype Analysis 

For single marker association to the disease, we use Fisher exact test to 
calculate a two-sided P-value for each individual allele. When presenting the results, 
we use allelic frequencies rather than carrier frequencies for microsatellites, SNPs and 
5 haplotypes. Haplotype analyses are performed using a computer program we 
developed at deCODE called NEMO (NEsted MOdels) (Gretarsdottir, et ai y Nat 
Genet 2003 Oct;35(2):13 1-8). We use NEMO both to study marker-marker 
association and to calculate linkage disequilibrium (LD) between markers, and for 
case-control haplotype analysis. With NEMO, haplotype frequencies are estimated by 

10 maximum likelihood and the differences between patients and controls are tested 
using a generalized likelihood ratio test. The maximum likelihood estimates, 
likelihood ratios and P-values are computed with the aid of the EM-algorithm directly 
for the observed data, and hence the loss of information due to the uncertainty with 
phase and missiiig genotypes is automatically captured by the likelihood ratios, and 

15 under most situations, large sample theory can be used to reliably determine statistical 
significance. The relative risk (RR) of an allele or a haplotype, i.e., the risk of an 
allele compared to all other alleles of the same marker, is calculated assuming the 
multiplicative model (Terwilliger, J.D. & Qtt, J. A haplotype-based haplotype relative 
risk' approach to detecting allelic associations. Hum Hered 42, 337-46 (1992) and 

20 Falk, C.T. & Rubinstein, P. Haplotype relative risks: an easy reliable way to construct 
a proper control sample for risk calculations. Ann Hum Genet 5 1 ( Pt 3), 227-33 
(1 987)), together with the population attributable risk (PAR). 

In the haplotype analysis, it may be useful to group haplotypes together and 
test the group as a whole for association to the disease. This is possible to do with 

25 NEMO. A model is defined by a partition of the set of all possible haplotypes, where 
haplotypes in the same group are assumed to confer the same risk while haplotypes in 
different groups can confer different risks. A null hypothesis and an alternative 
hypothesis are said to be nested when the latter corresponds to a finer partition than 
the former. NEMO provides complete flexibility in the partition of the haplotype 

30 space. Li this way, it is possible to test multiple haplotypes jointly for association and 
to test if different at-risk haplotypes confer different risk. As a measure of LD, we use 
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two standard definitions of LD, D' and R 2 (Lewontin, R, Genetics, 49:49-67 (1964) 
and Hill, W.G. and A. Robertson, Tlieor. Appl. Genet., 22:226-231 (1968)) as they 
provide complementary information on the amount of LD. For the purpose of 
estimating D' and R 2 , the frequencies of all two-marker allele combinations are 
5 estimated using maximum likelihood methods and the deviation from linkage 

disequilibrium is evaluated using a likelihood ratio test. The standard definitions of 
D' and R 2 are extended to include microsatellites by averaging over the values for all 
possible allele combinations of the two markers weighted by the marginal allele 
probabilities. 

10 The number of possible haplotypes that can be constructed out of the dense set 

of markers genotyped in the 1-LOD-drop is very large and even though the number of 
haplotypes that are actually observed in the patient and control cohort is much 
smaller, testing all those haplotypes for association to the disease is a formidable task 
Note that we do not restrict our analysis to haplotypes constructed from a set of 
15 consecutive markers, as some markers may be very mutable and might split up an 
otherwise well conserved hapiotype constructed out of surrounding markers. 

The approach We take to the problem of identifying those haplotypes in the 
candidate region that show strongest association to the disease is two-fold. First, we 
restrict the haplotypes we test to span a sub-region small enough that the included 
20 markers may be expected to be in substantial LD. m this study, we only consider 
haplotypes that span less than 300kb. Second, we apply an iterative procedure that 
gradually builds up the most significant haplotypes. Starting with haplotypes 
constructed out of 3 markers, we select those haplotypes that show strong association 
to the disease, add other nearby markers to those haplotypes and repeat the association 
test By iterating this procedure, we expect to identify those haplotypes that show 
strongest association to the disease. 



Results 

For the association analysis, we genotyped 590 non-obese Icelandic Type H 
30 diabetes patients and 477 unrelated population controls using a total of 84 

microsatellite markers. These markers are distributed evenly across a region of 
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approximately 3.5 Mb. The region is centered on our linkage peak and corresponds to 
the 1-LOD-drop. We then applied the procedure described above and looked for 
single-markers and haplotypes consisting of up to 5 markers that showed association 
to the disease. The result is summarized in FIG. 6. In FIG. 6, we show the location of 
5 a marker or a haplotype on the horizontal axis and the corresponding P-value from the 
associaton test on the vertical axis. This is shown for all haplotypes tested that have a 
P-value less than 0.01. The horizontal bars indicated the size of the corresponding 
haplotypes and the location of all markers is shown at the bottom of the figure. All 
locations are in Mb and refer to the NCBI Build33. 

10 We observe a series of correlated haplotypes that show strong association for 

non-obese diabetics in two locations within the 1-LOD-drop. We denote those 
regions A (168.37 - 168.83Mb) and B (169.70 - 170.17Mb), and in Table 10 we list 
the most significant haplotype in each of those regions. For each haplotype, the table 
includes a two-sided single-test P-value for association, calculated using NEMO, the 

15 corresponding relative risk, the estimated frequency of the haplotype in the patient 

and the control cohorts, the region the haplotype spans, and the markers and alleles (in 
bold) that define the haplotype. : / 

Note, however, that some of the haplotypes listed within each of the two 
regions are very correlated and should be considered as a single observation of 

20 association to the disease. This is demonstrated for region B in Table 3, which lists 
the pairwise correlation, both D' and R 2 , between the haplotypes. Based on the 
correlation, we observe that haplotypes B2 and B4 are strongly correlated and should 
be considered as a single observation of association to this region. Likewise, 
haplotypes Bl and B5 are strongly correlated. However, haplotypes Bl, B2 and B3 

25 are all weakly correlated with each other; and in fact, B 1 and B2 are mutually 

exclusive, i.e., never appear jointly on the same chromosome. These three haplotypes 
hence constitute three almost independent observations of association to non-obese 
diabetes of this region within the locus. It is possible to test haplotypes Bl, B2 and 
B3 together as a group for association to non-obese diabetes. This test yields a P- 

30 value = 8.5x1 0" 8 with a corresponding relative risk of 5.2, a population attributable 
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risk of 13.9%, and an allelic frequency of 0.089 and 0.018 in the patient and the 
control cohorts, respectively. 

Table 2 





P-value 


RR 


Aff.fr 


Ctri.fr 
a 

M 


Span 
(Mb) 


Haplotype 


Al 


0.000005 


> 10" 


0.033 


0.000 


168.37 
168.72 


0 DG5S879 4 DG5S881 -4 D5S2075 
0 DG5S883 4 DG5S38 


A2 


0 O00006 


3.81 


0.053 


0.015 


168.55- 
168.77 


4 DG5S1058 -6 DG5S37 


A3 


0 000008 

w > KJ \J \J\J \S \J 


3.64 


0.054 


0.015 


168.55- 
168.83 


4 DG5S1058 -6 DG5S37 0 
DG5S101 


A4 


0 000015 


6.18 


0.046 


0.008 


168.40- 
168.72 


4 DG5S881 4 DG5S1058 -4 
D5S2075 0 DG5S883 4 DG5S38 


A5 


0 000015 


4.42 


0.047 


0.011 


168.37- 
168.77 


0 DG5S879 4 DG5S1 058 -6 
DG5S37 


A6 


0.000018 


6.94 


0.045 


0.007 


168.40- 
168,72 


4 DG5S881 -4 D5S2075 Q DG5S883 
4DG5S38 


Bl 


0.000011 


> 10 


0.039 


0.000 


169.87- 
170,17 


0 DG5S953 0 DG5S955 0 DG5S13 5 
DG5S959 ■ 


B2 


0.000023 


> 10 


0.034 


0.000 . 


169.65- 
169.87 


27 DG5S888 0 DG5S953 


B3 


0.000023 


5.26 


0.049 


0.010 


169.87- 
170.04 


0 DG5S953 0 DG5S955 4 QG5S124 


B4 


0.000031 


> 10 


0.034 


0.000 


169.65- 
169.87 


27 DG5S888 0 DG5S44 0 DG5S953 


B5 


0.000060 


> 10 


0.034 


0.000 


169.87- 
170.17 


0 DG5S953 0 DG5S955 0 DG5S13 0 
DG5S123 5DG5S959 



5 

Table 2: Haplo types within the 1-LOD-drop that show the strongest association to 
non-obese diabetes. For each haplotype, we show (/) a two-sided P-value for a single 
test of association to non-obese diabetes, (ii) the corresponding relative risk (RR), (iii) 
the estimated allehc jfrequency of the haplotype in the patient and the control cohort, 
10 (fv) the span of the haplotype (refering to NCBI 33) and (v) the alleles (in bold) and 
markers that define the haplotype. The haplotypes are separated into two groups, A 
and B, corresponding to two different regions within the 1-LOD-drop. 
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Table 3 





Bl 


D' 

B2 


B3 


B4 


B5 


Bl 




0 


0 


0 


1 


R2 B2 


0 




0.4 


1 


0 


B3 


0 


o.i 




0.35 


0 


B4 


0 


0.96 


0.7 




0 


B5 


0.92 


0 


0 


0 





Table 3: Pairwise correlation between the five haplotypes in the B-region that show the 
5 strongest association to non-obese diabetes. Estimates of D' are shown in the upper right 
comer, and estimates of R 2 are shown the the lower left corner. The haplotypes are labelled 
Bl, ...,B5 as in Table 2. 

Investigation of Region B 

10 Genes in Region B 

We next identified all genes in and around region B (UCSC). In the region 
defined by the five most significant haplotypes, 1 69.70 - 170. 1 7 Mb, there are four 
genes, %CP2 (lymphocyte cytosolic protein 2), KCNMB1 (potassium large 
conductance calcium-activated channel, subfamily M, beta member 1), KChlPlQLv 

15 channel interacting protein 1) and GABRP (gamm a-aminobutyric acid (GAB A) A 
receptor, pi). Of those genes, KChlPl is by far the largest^ stretching from 169.7 to 
170.1 MB, or almost the entire span of the observed haplotype association. The other 
three genes are small. In addition, there is a big gene, RANBP1 7 (RAN binding 
protein 17), just telomeric of the location of the observed association signal. The 

20 relative location of all the genes is shown in FIG, 7, which shows the location of the 
exons of KCHIP1 as solid bars, and the location of the other genes as shaded boxes. 
In addition, FIG. 7 shows the location of the micrdsatellites (filled boxes) that we 
have typed in this region and the location of the at-risk haplotypes Bl, ...,B5 (gray 
horizontal lines). 

25 
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Description of new Splice Variants of KCMP1 Identified by RACE and PCR 

The published sequence for KChlPl comprises exons 1 to 8. New exons 
belonging to the KChIP 1 gene and four different splice variants were discovered by 
performing RACE or PCR (primers within the exons) using as template human 
5 Marathon cDNA and cDNA prepared from rat pancreatic INS1 beta cells. In all, 6 
new exons located in the 5 ' region of the gene were discovered.. An alternative exon 
1 was found that we call exon la. Here, we label the published sequence for exon 1 
with a "b" to distinguish it from the alternative exon 1, exon la. Four exons are 
called UTR 1, UTR 2, UTR 3 and UTR 4, or untranslated region 1 - 4, because they 

10 lie upstream of exon lb and they are not translated. The last exon to be identified is 
called Lis-r, or insert rodent, because it was known to be present in mouse and rat, and 
has recently been demonstrated by others to be present in humans as well (Bolaiid et 
dl, Am J Physiol Cell Physiol 285, C161-170. (2003)). See nucleotide sequences of 
the new exons below, as well as their location in the genomic sequence of NCBI build 

15 33. Even if not mentioned, all new variants of KChlPl found and described below 
include exons 2 - 8 of the published sequence. 

Splice variant 1 consists of exon la, UTR1, UTR2, UTR3, UTR4 and exon lb. 
Exon la is untranslated and the resulting protein is identical in amino acid sequence to 
KChlPl described by An et dl (Nature 430, 553-556 (2000), see also FIG.2). This 

20 variant was observed in human heart and testis and the fat INS 1 cell line. 

Splice variant 2 consists of exon lb and the Ins-r exon giving rise to a protein 
that is identical in amino acid sequence to KChIP 1 described by Boland et ah . This 
variant was observed in human brain* heart, pancreas and the rat INS 1 cell line. 

Splice variant 3 consists of exon la and is identical in nucleotide sequence to 

25 AL538404, an EST in NCBI. The amino acid sequence of the N-terminus coded by . 
exon la is unique (see sequence below) but the amino acid sequence coded by exons 
2 - 8 is that of the published sequence. This variant was observed in human brain, 
heart, pancreas, skeletal muscle* adipose tissue, liver, hypothalamus, small intestine, 
testis and the rat INS 1 cell line. 
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Splice variant 4 consists of exons 1 a and UTR1 , which would result in a 
protein translated from exons 2 - S. The second methionine in exon 2 has a Kozak 
. sequence. This variant was observed in human heart. 

The nucleotide sequences of the new exons are as follows (the genomic 
5 locations given are from NCBI build 33, see also Table 8): 



Exon la: 169716298 - 169716511 (Build 33) 

GGCTTCAGGGGTGCATCCGTCACTCAGGGTTCATTCACCCAGGCAGGCTCCAAGT 
TCCTGGGGTGCACAAGGTGGGCACTGTCCCTTCTGGGTGCTGACAGCAGAGCCTG 
1 0 GCTCCCCTCCGCCACCATGAGCGGCTGCTCCAAAAGATGCAAGCTTGGGTTCGTG 
AAATTTGCCCAGACCATCTTTAAGCTCATCACrGGGACCCTCAGCAAA^ (SEQ ID 
NO: 4) 

UTR1: 169848417-169848523 (Build 33) 
1 5 ACTCAGCATCATCAAGACTGGAGGGACAGAGCATTTGAATCATCAGACGCTGGGC 
CAGACGTCACCCCACGCGTTTTCTCATTTTATC GTCCTAAGAAGCCCAGAAG (SEQ 
ID NO: 5) 

UTR2: 169861083-169861 154 (Build 33) 
20 CCTGAATGCAATTTGCAATGAGGAGATGATTTGATTTTCTT 
AGCTTCCTGAGAGCAG(SEQ ID NO: 6) 

UTR 3: 169864589-169864679 (Build 33) 

GGGTTCCCCAGGAGACCACGACAGAGGCGTGGAACCCAAGTTCTAATCCCACATC 
25 CTGGCTGGGCAACITCAGGCAAATTTCtAACACAAG (SEQ ID NO: 7) 

UTR 4: 169867066-169867173 (Build 33) 

GGTAGGGGAGGGGCCGGGCCCGGGGTCCCAACTCGCACTCAAGTCTTCGCTGCCA 
TGGGGGCCGtCATGGGCACCTTCTCATCTCTGCaAACCAAACAAAGGCGACCC 
30 (SEQ ID NO: 8) 

Ins-r 170075401-170075433 

ACATCGCCTGGTGGTATTACCAGTATCAGAGAG (SEQ ID NO: 9) 

35 The nucleotide sequence derived from splice variant 4 (KChlPl .4) with the 

ATG and a Kozak sequence ((G/ANNATGG) underlined is as follows: 

ATAAGATTGAAGATGAGCTGGAGATG ACCATQGT ITGCCATCGGCCCGAGGGACT 
GGAGCAGCTCGAGGCCCAGACCAACTTCACCAAGAGGGAGCTGCAGGTCCTTTAT 

40 CGAGGCTTCAAAAATGAGTGCCCCAGTGGTGTGGTCAACGAAGACACATTCAAGC 
AGATCTATGCTCAGTTTTTCCCTCATGGAGATGCCAGCACGTATGCCCATTACCTC 
TTCAATGCCTTCGACACCACTCAGACAGGCTCCGTGAAGTTCGAGGACTTT 
CGCTCTGTCGATTTTATTGAGAGGAACTGTCCACGAGAAACTAAGGTGGACATTT 
AATTTGTATGACATCAACAAGGACGGATACATAAACAAAGAGGAGATGATGGAC 

45 ATTGTCAAAGCCATCTATGACATGATGGGGAAATACACATATCCTGTGCTCAAAG 
AGGACACTCCAAGGCAGCATGTGGACGTCTTCTTCCAGAAAATGGACAAAAATAA 
AGATGGCATCGTAACnTTAGATGAATTTCITGAATCATGTCAGGAGGACGACAAC 
ATCATGAGGTCTCTCCAGCTGTTTCAAAATGTCATGTAACTGGTGACACTCAGCCA 
TTCAGCTCTCAGAGACATTGTACTAAACAACCACCTTAACACCCTGATCTGCCCTT 

50 GTTCTGATTTTACACACCAACTCTTGGGACAGAAACACCTTTTACACT^ 
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AITCTCTGCTGAAGACrrTTCTATGGAACCCAGGATCATGTGG 
GCCAACTCTTCCY CTTTCTTCTTCTTG AGAGAG A (SEQ ID NO: 10) 

5 The protein sequences resulting from the splice variants are as follows: 

KChIPl.3 

(The amino acid sequence derived from splice variant 3 (KChIPl.3), the underlined amino 
acids are coded by exon la.) 
10 MSGCSKRCKLGFVKFAOTIF^ 

KRELQVLYRGFKNECTSGVVNEDTFKQIYAQFFPHGD^ 
KI^DFWAI^IIXRGTVHEKLJIWTF^ 

PVLKEDTPRQHVDVITQKM^^ (SEQ 
ID NO: 11) 

15 

KChIPl.2 

(The amino acid sequence derived from splice variant 2 (KCMP1.2), the underlined amino 
acids are coded by exon Ins-r.) 

MnAVMGTFSSLQTKOKRPS KDIAWWYYOYOR^ 
20 QTNFTKRELQVLYRGFKKECTSGVVOT 
QTGSVKFEDFVTALSDXRGTV^ 
GKYTYPVLKEDTTRQHVDVFFQ 
M (SEQ ED NO: 12) 

25 KChIP1.4 

(The axnino acid sequence derived from splice variant 4 (KChIP1.4).) 

MVCmPEGLEQLEAQTNFTKREL 

STYAmn^FNAFDTTQTGSVKFEDFVTAI^ 

EMMDIVKAIYDMMGKYTYPV^ 
30 DDNIMRSLQLFQNVM (SEQ ID NO: 13) 



Identification of SNPs and Microsatelliies 

In order to identify SNPs across KChlPl, all exons of KChJPl and their 
35 flanking regions were sequenced on 94 non-obese diabetic patients. As a 

consequence, 31 SNPs were identified (Table 9). Additional SNPs were identified 
across the gene by selecting SNPs from the public domain (US National Center for 
Biotechnology Information's SNP database) and designing SNP assays for them. 
(Table 10). 

40 We genotyped SNPs on 470 non-obese diabetics and 658 population-based 

controls using a method for detecting SNPs with fluorescent polarization template- 
directed dye-terminator incorporation (SNP-FP-TDI assay) (Chen, X., Zehnbauer, B., 
Gnirke, A. & Kwok, P.Y. Proc. Natl Acad. Set USA 94, 10756-10761 (1997)). 



45 
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Association Study of Genes in Region B 

We tested all the genes in and around Region B (LCP2, KCNMB1, KChlPl, 
GABRP and RANBP1 7) individually for association to non-obese diabetes. In the 
analysis of each gene, we included all SNPs identified, and previously typed 
5 microsatellites, in and close to that gene. The association analysis was carried out in 
the same way as the locus-wide association, i.e., using the iterative approach, we 
search for haplotypes, shorter than 300kb, that showed strongest association to the 
disease. 

The strongest association observed was for KChLPL For KChlPl, we tested 
10 25 markers, 7 microsatellites and 1 8 SNPs, for association (Table 1 1). The strongest 
association signal was observed in the 3'-end of the gene; a three marker haplotype 
with a P-value = 9.2xl0" 5 , relative risk 12, and aUelic frequency 3.6% and 0.3% in the 
patient and control cohorts, respectively. This haplotype, which extends over the last 
8 oxons of KChlPl, from 169.96 to 170.11Mb, is listed in Table 4 as Dl. We also 
15 observed another haplotype in the same region that showed association to non-obese 
diabetes, albeit less significant than Dl, with a P-value = 0.037, relative risk 1.69 and 
allelic frequency 7.8% and 4.8% in the patient and the control cohorts, respectively. 
This haplotype is labelled D2 in Table 4. For risk haplotypes, the corresponding 
population attributable risk is PAR = 4.9% for D 1 and PAR = 4.7% for D2. However, 
20 as Dl and D2 are independent haplotypes, i.e., they do not appear jointly on the same 
chromosome, their population attributable risk can be added together. 



Table 4 



Icelandic 


P-Value 


RR 


Aff.frq. 


Ctrl.frq 


Haplotype 


Dl 
D2 


9.20E-05 
0.037 


12 
1.69 


0.036 
0.078 


0.003 
0.048 


-4 DG5S13 C KCP 1152 
0 D5S625 

0 DG5S124 C KCP 1152 C 
KCP 2649 T KCP 4976 A 
KCP 16152 


Danish 












Dl 
D2 

* /-V ! _J 1 


0.052* 
0.002* 


2.98 
2.74 


. 0.031 
0.098 


0.011 
0.038 


-4 DG5S13 C KCP 1152 0 
D5S625 

0 DG5S124 C KCP 1152 C 
KCP 2649 T KCP 4976 A 
KCP 16152 



* One-sided P-value 
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Table 4: Microsatellite and SNP haplotype association within KChIPL The two 
independent haplotypes Dl and D2 are located in the 3 f -end of the gene, from 169.96 - 170.1 1 
Mb. Shown are results of a test of association for non-obese diabetics vs population controls 
for both haplotypes in a cohort of Icelandic diabetics (top) and a replication in a cohort of 
5 Danish diabetics (bottom). Note that we report one-sided P-values for the test on the Danish 
cohort as that is a replication of association results previously observed in the Icelandic 
cohort. 

Replication in a Cohort of Danish Diabetics 

10 We typed the markers that define the two at-risk haplotypes, Dl and D2, in a 

cohort of 1 49 non-obese Danish females that have been diagnosed with diabetes 
and/or measured >7rnM glucose who participated in a Danish PERF (Prospective 
Epidemiological Risk Factors) study. As controls, we used 346 females from the 
same study that answered no to a question about their diabetes status and/or measured 

15 <7mM glucose. 

The results of the association test for the two at-risk haplotypes, identified in 
the Icelandic diabetes cohort, are listed in Table 4. Both haplotypes appear in higjier 
frequency in the non-obese Danish diabetics than in the control cohort. For haplotype 
Dl, the association to non-obese diabetes is only marginally significant, with a one-* 

20 sided P-value = 0.05, and th£ relative risk of the at-risk haplotype is RR = 3.0, 

somewhat less than is observed for the Icelandic noii-obese diabetics, Note, however, 
that the estimated frequency of haplotype Di is very low, especially in the control 
cohorts, hence the estimates of the relative risk are not very reliable. For haplotype 
D2, on the other hand, we do observe a statistically significant association with a one- 

25 sided P-value = 0.002 and relative risk = 2.74. Note that as the test of association of 
haplotypes Dl and D2 are attempts to replicate the association we have observed for 
Icelandic non-obese diabetics, it is appropriate to report one-sided P-values for those 
tests. 

30 Additional SNP Genotyping for KCMP1 

Having observed association to the 3'-end of KChlPl, both in Icelandic and 
Danish non-obese diabetics, we subsequently sequenced 94 Icelandic individuals, 1/3 
non-obese type E diabetes patients with the observed haplotype Dl, 1/3 additional 
non-obese type II diabetes patients and 1/3 controls. The purpose of the sequencing 
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was to identify additional SNPs. We identified 725 SNPs (Table 12). Many of those 
SNPs were completely correlated so we removed several redundant SNPs from 
further genotyping. Some SNPs with very low minor allele frequencies were also 
ignored. Of the 725 identified SNPs plus what was originally identified, 108 were 
5 selected for further genotyping in the Icelandic cohort (Table 13). 

We performed a single-marker test of association to non-obese diabetes for 
each of the additional SNPs we typed, although none of the SNPs showed a strong 
association. We did, however, observe that three of the SNPs, KCP_197678, 
KCP_1 97775 and KCPJ202795, increased the specificity of haplotype D2, if added to 

10 that haplotype, while still retaining most of its sensitivity. This is shown in Table 5, 
both for the association in the Icelandic and in the Danish cohorts. This increases the 
value of the at-risk haplotype as a diagnostic tool. Note that the three SNPs are very 
correlated to each other, with pairwise correlation coefficients D' « 0.96 and R 2 « 0.9, 
hence the association of haplotypes D3, D4 and D5 to non-obese diabetes should be 

15 considered as a single observation. 

In addition to the refinement of the at-risk haplotype D2, we observed another 
refinement of the at-risk haplotype, consisting of three SNPs only, that was very 
correlated with the three at-risk hapldtoypes, D3, D4 and D5, with pairwise 
correlation coefficients D' « 0.83 and R 2 « 0.59, This haplotype is included in Table 

20 5 as D6. 
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Table 5 





P- 


RR 


PAR 


Aff.frq. 


Ctrl.f 


Haplotype 




Value 












Icelandic 














D2 












0 DG5S124 C KCP_1 152 C KCP_2649 T 




0.037 


1.69 


6.3% 


0.078 


0.048 


KCP_4976 A KCP_16152 


D3 












0DG5S124CKCP 1152CKCP 2649 T 




0.022 


2.19 


5.5% 


0.052 


0.024 


KCP 4976 A KCP 1 6 1 52 T KCP" 1 97678 


D4 












0 DG5S1 24 C KCP 1 152 C KCP ^649 T 




0.052 


2.03 


4.6% 


0.046 


0.023 


KCP 4976 AKCP 16152 TKCP 197775 


D5 












0 DG5S 1 24 C KCP 1 152 C KCP "2649 T 




0.023 


2.14 


5.5% 


0.052 


0.025 


KCP 4976 A KCP 1 6152 C KCV 202795 


D6 




1.77 








AKCP 173982 C KCP 15400 C 




0.054 


4.0% 


0.046 


0.027 


KCP 18069 


Danish 














D2 












0 DG5S124 CKCP 1 152 C KCP 2649 T 




0.002* 


2.74 


12.0% 


0.098 


0.038 


KCP_4976 A KCP^l 6152 


D3 


0.0046 










0 DG5S124 C KCP 1 152 C KCP 2649 T 




* 


2.60 


9.0% 


0.076 


0.030 


KCP_4976AKCP 16152 TKCP 197678 


D4 


0.0004 










0 DG5S124 C KCP_1 152 C KCP 2649 T 




* 


3.69 


11.3% 


0.078 


0.023 


KCP_4976AKCP 16152 TKCP 197775 


D5 


0.0002 










0 DG5S124 CKCP 1152 CKCP 2649 T 




* 


3.67 


11.7% 


0.084 


0.024 


KCP 4976 A KCP 16152 C KCP 202795 



* One-sided P-value 



Table 5: Microsatellite and SNP haplotype association within KChlPl. Shown is association 
of the at-risk haplotype D2, and of further refinements of that haplotype; haplotypes D3, D4 - 
5 and D5, to non-obese diabetes. This is shown both for the Icelandic and the Danish cohorts 
and, as in Table 4, we report one-sided P-values for the association test in the Danish cohort. 
Finally, we include the result of association to non-obese diabetes, in the Icelandic cohort, of 
a 3 SNP haplotype, D6, that is strongly correlated with the at-risk haplotoypes D3, D4 and 
D5. 

10 

Allele Numbering System 

SNP alleles are indicated by the letters found in the DNA sequence. In 
general the alleles can be references by A=0, C=l, G=2 and T=3. For microsatellite 
alleles, the CEPH sample (Centre d'Etudes du Polymorphisme Humain, genomics 
15 repository) is used as a reference, the lower allele of each microsatellite in this sample 
is set at 0 and all other alleles in other samples are numbered according in relation to 
this reference. Thus allele 1 is 1 bp longer than the lower allele in the CEPH sample, 
allele 2 is 2 bp longer than the lower allele in the CEPH sample, allele 3 is 3 bp longer 
than the lower allele in the CEPH sample, allele 4 is 4 bp longer than the lower allele 
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in the CEPH sample, allele -1 is 1 bp shorter than the lower allele in the CEPH 
sample, allele -2 is 2 bp shorter than the lower allele in the CEPH sample, and so on. 

. Table 6: 

5 The DNA sequence of the microsatellites employed for the C05 locus wide 
association (including Build 33 locations). 

Y= C or T; S = C or G; R = A or G; W = A or T; M = A or C; K = G or T. 



TABLE 6 



Name 


Position 


Nucleic Acid Sequence 


SEQID 
NO: 


DG5S5 


167638990 
167639163 


TCCTCAGAACAGGTGCAACACAGTGTGTTTTGCTGGGG 
AAAAGGGATGTCAAGCAATCTATGACGGGGGTGCAGG 
GAGTCTGGGGAGAAACACAAGGAAGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTGAATGTGTGTGTGTGTGAGAG 
AGAGAGCTGGTGTTTGTGTTCCA 


SEQID 
NO: 14 


D5S671 


167657904 
167658237 


GGAATGTGCCAAGACATTCTTTAGGGTTGGTAACCAG 

AGACGCTATTTTGTCCTTGGTGGCTAAGAAATCACTTT 

TCTGACTGAAGGNCCATTTGACITACTTCTTT^ 

CAGGGGAATGGGTGGGCATCTCCATGATTCAGGTAAG 

GAAAAATCCAAGGNAAATAAACACACACACACACAC 

ACACACACACACACACACACGGAGTAGAAATTTTTAG 

TGCAATTTTTTGTCTCACAGCATTAATTAATTGCAGGG 
ATATAACTACCTTGOPAn A ATTTTTTrrrrrr a A-nnnA 

CCACGCCCCGGAATAAGTTTGGCTCTTTTCAGCT 


SEQID 
NO: 15 


DG5S870 


167719773 
167719939 


TGCCCACTCATAAGATGCTGAGGTTACAACTGTTAATA 

AGATATTAAGATACTGTCTTTTTCTTCCT 

TACACACACACACACACACACACACACACACTTTTTG 

GGCCAACTGGAAATTCATACATTCTCCCCAGCACTGGA 

GCTCAAAGCGTCTG 


SEQID 
NO: 16 


D5S67I 


167657904 
167658237 


GGAATGTGCCAAGACATTCTTTAGGGTTGGTAACCAG 

AGACGCTATTTTGTCCTTGGTGGCTAAGAAATCACTTT 

TCTGACTGAAGGNCCATTTGACITACnTCTTTTAAA^ 

CAGGGGAATGGGTGGGCATCTCCATGATTCAGGTAAG 

GAAAAATCCAAGGNAAATAAACACACACACACACAC 

ACACAC ACAC ACACAC ACACGGAGTAGAAATTTTTAG 

TGCAATTTTTTGTCTCACAGCATTAATTAATTGCAGGG 

ATATAACTACCTTGGCAGAATTTTTTCTCCCCAACCCA 

CCACCCCCCGGAATAAGTTTGGCTCTTTTCAGCT 


SEQID 
NO: 17 


DG5S870 


167719773 
167719939 


TGCCCACTCATAAGATGCTGAGGTTACAACTGTTAATA 

AGATATTAAGATACTGTCTTTTTCTTCCTCTT^ 

TACACACACACACACACACACACACACACACTTTTTG 

GGCCAACTGGAAATTCATACATTCTCCCCAGCACTGGA 

GCTCAAAGCGTCTG 


SEQID 
NO: 18 



MSDOCID: <WO 2004041 193A2_I_> 



WO 2004/041193 



-103- 



PCIYUS2003/034681 



DG5S85 . 


167721558 
167721918 


TTGTTGTTGTTGGTGGTGGTGGGGTGTGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTTCGAGACAGACTCTC 

ACTCTGTCACCCAGGCTGGAGTGCAGTGGCACGATCT 

GGGTTCACTGCAACCTCTACTTCCTCAGCTCCAAGGAT 

CCTCTCACCTCCACCTCCCAAGTAGCTGGGACTACAGG 

TACGCGCCACCATGTCTGGCTAATTTTTTTGTATTGGA 

GAGACAGGGTTCCACCATGTTGCCCGGGCTAGTGTTGC 

ACTCCTGAGCTCAGGTGATCCACCCACCTCAACGTCCC 

CAAGTGCTGGGATTAGAGGCGTGAGCCAGCACGTCTG 

GCCTATACACTATAGAGTTT 


SEQID 
NO: 19 


DG5S90 


167766290 
167766502 


TCTGGACAGGACCAGGAGTTGGCTGCTGTCAGCCTTTG 

CCCCACCTCTCTGTGGCTACTGGGTATGTGAATCTCTC 

AAGGCCTGAAGAGAGGACAGCTGAGGAATTTGGAAAT 

CCTAAAACACATGCATACACACACACACACACACACA 

CACACACACACACACACACTTTTCTTTCCCTTAAAAAA 

AAAAAGATTCATTCACCGTGTGCA 


SEQID 
NO: 20 


DG5S874 


167846718 
167847065 


CTGTCTACACTACCCACCCATTAGTCACTTATTAGCCC 

TCTGAATTACTGGATTGAAAAAACATAGTATATATATA 

GGGCTTGGTACTATTCACGGTTTCAGGCATCCACTGAG 

GGGTGTTGCAATGTATCTCCCACGGATAAGGAAGGAC 

TGGTATATTAACACnTTTATTTGATTTACAAAATAAAG 

GATAGTTTATATAGTTCTGGGTAAAATTAATTAATTAA 

TTTAAAAGGAAAAAAGATAAAGGCAAACTTTAAGC^ 

GTTAAAAATTAAGTAAAATAATTTGGATTATTTAATTG 

GACAAAGAGGACTGGCTTTGCCAATGAAACAATATGG 

CCGACATG 


SEQID 
NO: 21 


DG5S88 


167864864 
167865059 


GGACc 1 i CITTCTGCCCTAAAACCGCAATATCATTATA 

ATAACAAATATATATATATATATATATATATTTTTTTTT 

TTTAAAACAATCTTGCTATGTTGCCTAGGCTGGTGTGG 

AACTCCTGGCCTCAAGTGATCCTCCCACCTCGGCCTCC 

CAGAGTGCTGGGATTATAGACATGAACTACCATACCC 
AGCCA 


SEQID 
NO: 22 


DG5S7 


167910343 
167910651 


CACAGCCATCAAGTTTCCAACTTACTGCCTCACATATT 

AAGATGAii iiiii'AAACAAACTTAACAGGCGATGGAT 

ACTCCATTCTCCATGATGTGCTTAATTCACATGCATGC 

TTGTATCAAAACATCTCACATACTCCATAAAGCCTGTA 

ATCCCAACACTTTGGGATGCCAAGGTGGGTGGATCAC 

TTGAGCCCAGGAGTTTGAGAACAGCCTGGACAACATG 

GCGAAACCCCATTTACACACACACACACACACACACA 

CACACACACCACACAAACAAAATGAAACAAACACCTA 

ACCAACAA 


SEQID 
NO: 23 


DG5S6 


167952553 
167952858 


TCCTAACGGCTGCTACCACTAAAGATCTTAGCATGGTG 

TGTGTGTGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

TGTGTGGTGGGGGTATTCAGTAAGGCTAGAAGTGAAA 

AAGCTAGTAGAAAGCCCATGGTGATGGAGAATGGAGG 

AAGACTGATTAGGGAGCTCCTCAGCAGTATAAGGAAG 

GACTAAGAGCACATAAGGACAGGATCATAGAATTCCG 

CATCTCAGGATTTTTGAGGCTGCCACTGCCTTAGCTGT 

GAGGCCAGTGCATATAAGAATA OTnTTr^ apa nTrrrn 

CTGTGG 


SEQID 
NO: 24 


DG5S87 


167992779 
167993149 


CCTCTGGGATTAGCCTCTCAGGGTACAGATATAACGAT 

GATTGAGTTGGCTTATGTATGTGTGTGTTGTGTGTGTG 

TCTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGA 

GAGAGAGTGACAGAGAGAGAATGAGAGAGAACTGGA 

AGTTGTCAACAAGAAGAGTCAAACTCTGTAAAATATT 

TGAAGAGATTTATTCTGAGCCAAATAGGAGTGCCACA 


SEQID 
NO: 25 
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GCCCCGGGAGATCCTAAGAACATGTGCCCAGAGTAGT 
CAAGCTATAGTTTGGTTTTATACATTTTAGGGAGACAT 
AAGACATCAGTCAATACATGTAAGATGCACATTGATA 
CACTGGTTTAGTAGGGAAAGGTGGGACAACTCGAA 




DG5S9I 


168014827 
168015078 


GGTGCCAATTAAATCCAACAAGGTAGCTGAGTGTGGT 

GGTGCACGCCTGTAGTCCTAGCTATGCAGGAGGCTGA 

GGTGGGAGGATCACTTGAGCCTGGGAGGTCGAGGCTG 

CTGTGAGCTGTGATTGCACCGCTGCATTCCAGCCTGGG 

AGACAGAGCAAGATCCTGACACACACACACACACACA 

CACACACACACACACACACACACACATTCCAACAAGG 

TAATGTGTAGGAGGAAGTACCCGAGCTT 


SEQED 
NO: 26 


DG5S92 


168065529 
168065864 


CAACTCCTGCAGCCCTTTACGCCAAGCACAGAAATCC 
AGGAGGC! AG A GPPT A GPGPTTri A Tf^ AC a TnrzT a atth 

GGCCTGGAAGTGGGGATTTCTGTCACTTACCTCTCCTT 
GAAAAATAATCACTATTGCCAACGCCTGGTTAATTAGC 
CTGATTCAATTCTCrrCAGCCTCATTTTGCTCAAATCT 
CCAGATTTGTGGTGCTCCTTGGTCCTCCACCACACTTT 

CTACCCCTCATCCCACTTTGTGTGTGTGTGTGTGTGTf^T 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGGACA 
TGGCCAGGAATCCTGACTGGCTTCCTTTAAA 


SEQID 

"MTV T7 


DG5S491 


168081175 
168081342 


AAGACCACCCTCCTGTTGTGCTCTCCTGAAATGTATTC 

ATATCCACCCATACACACACACACACACACACACACA 

CACACACACACACACACACACACACACATTCTCTCTCT 

CTCTCTCTTTCTCTCTCT^ 

TCCTGCT1TCCAGA 


SEQID 
NO: 28 


DG5S9 


168139425. 
168139680 


CTTGACATTCAGGGCCTTCTGAGTACATCATCTTGTCA 
AGAAACACTGAACTATTCAGTACACAACAGGTCAGAG 
GTGCCCATTTGATAGCCTGAGGATGGAATCCTTATTGC 
AGCATTTTGCGTCATGCCACAT AT ATGTGT F FTTTPA AT 

CCTCCTCTGTTTTAAAAATTGGAAAATTTCATACAACA 
CACACACACACACACACACACACACAGACACACACAC 
ACCCCCCATACCACACCACACCACATCA 


SEQID 
NO: 29 


DG5S876 


168266982 
368267134 


AGCCTCTGACTCTCCTCTGTGGGGCTAATCCAGAAAAT 
CTTACl'l'lAGAAATAACAATAATAATAATAATAATAAT 
AATAATACCTCATTCATCTxTACTTATCATGTGCTAGT 
ATGTTTCTAAGCCITTTGGCATAGrGTTr A A TnTrrrr 


SEQID 
NO: 30 


DG5S97 


168286866 
168287096 


CTCTTCCCATGTCCTGCTTCTCTCCTTCCTGCGGGTTGG 

GACCCAGTACCCTCCATCTCTCTCACTCCTCCCCTCTCA 

AACCCTTCinrTTAGGAAAGGAGTCCAAATCGACCACTT 

ACACCTCAGTTCAATGCAAGCCAGTATAATTAATAAG 

GAACATTTAAGGGTGTGTAAGGGTGTGTGTGTGTGTAT 

GTGTGTGTGTGTGTGTGTGTAGCTCACTCTGCCTCTGC 

C - 


SEQID 
NO: 31 


D5S2052 


168324273 
168324633 

- • 


GATCACCAGGGAATCTAGATGGAATCCATAGTNCTNC 

CCTGCAAGAATGCTGCAATTCTGTACCGTGGAGGNGC 

CAACAGAATCACCAGGCTCTGTGACTCAGTCACAACA 

CCCTGACCTGCCCCTGTCCATTCTCCATATCATACCCA 

GAGTGGTCTTTTCAAAGCACAGCTTTGACCAATTCTCT . 

GTC IT rCACACATACACACACACACACACACACACAC 

ACATGCGTGCATGCATGCCTGAAATAGTATAGTATTGC 

TCTTAAGATAAACATTAANGTTCCTACCATGGTACAGA 

AAATATATGTNGTTAGGCCCCGTGGCTCT1TCTTTTCC 

AGACTCCTCTTACCCTfTTGTG 


SEQID 
NO: 32 


DG5SS79 


168369069 
168369202 


AAATCll l CCATTOCAGACCAATTAAAATTTAAAGATTT 
TCTCTCTCnTCCCCCT 

CTCTCTCACACACACACACACACACACACACAACTCTC 


SEQID 
NO: 33 
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UAAGUA.CAAAGAGCTGA 




DG5S880 


CHR5: 
168376530 

168376775 


TGAGTGGATGAGGGAGAAGGATAAAAGTCATAAAATG 

CCTCGCAAAAACl'1'iGAGGTCTGCCTGCCTGGTATTAC 

AGAGAAGCTTGCACAATACAGAATGTTTTGTGGGAAG 

GAAGGCAGGCAGGCAGGCAGGCAGGCAGGCAGGTTG 

GTTATGTTTTCACTCTTGATATCTCAAAGCTTTATGACA 

CACTCATGGAGTGAACATAATCTTTGTGGCATGATACA 

AAGGGACTGAATCACTCAAG 


SEQID 
NO: 34 


D5S400 


168378412 
168378696 


AGCTATGATCATGCCACTGCACTCCAGCCTGGCTGATA 

GAATGAGATTCTGACTCAGAAAAATATAAACACACAC 

ACACACACACACACACACACACACACACACACACACA 

CACAACTTTCTGTCTGCCCCCTTGCTCTTCCTGTCCCAT 

CTCTGCCTTTCTTCTTTCCTCrCTTTGTCAAATCTCCTTC 

GTCTGCCTCACAAAGGCCAGTGAGCCCCAGCCGCAGA 

CCAGGGAAGCCAGCAAATTAGGAATTTTCTTCACAAA 

G1T1TGAGTAGCT 


SEQID 
NO: 35 


DG5S881 


168395631 
168395815 


TCTCCCATCTCCTCCCTAGCTATCCCCTGCCTGGTAATC 
ACTG Tili. u lUC-rcrATTTCTGTACAGTTGGGTTTTTGT 
TTGTTTGTTTGTTTTTAGATTCCACATATACTTGAAACA 
ATGCAATC1 1 lTl'CmOTrGTCTGGTTTATrrCACTT 
AACACAGTGCACTCCAGGCTCGTCTATG 


SEQID 
NO: 36 


D5S2043 


168499686 
168499956 


CCAGAAAGCTTCACCAAGGGGCGAGCATGATTATGCA 

TGAGCTTCTTAAATCTGGACTTCCCGACAGCTTCTCAT 

GACAGGTCTTCTGTGGAAGACTCCTTAGATTCAGACCA 

TCAGGCCTTTNAAAAGCACAGGAACCTACTTTACCTCG 

CCCAACTCTACGGATGGGATAGGNACTTACAAGGACA 

TTTCCTCATTGGATTCCAATGTTCATTCTCCCCTTCTCT 

CTCTCAATTAATCTCCCCCTCTTCTCTTTCTATCTACAC 

ACACACACACACACACACACACAGAGAGAGAGAGAG 

AGAGAGAGAGAGAGAGANAGAAACAGCTTCTTCACA 

GCGGGAAGCAGGGGAAGGGTATCTATTTCCGGCAAGA 
TC 


SEQID 
NO: 37 


DG5S1058 


168554788 
168555167 


AAAGTCAGCTAGGGCGACTGAGCCAGAGAGATGGGGC 

ACACAGCAAGAGGAGACCTGACAAAGTGCAGGTGTGT 

CCTAGAGAAGGCAAGCGAGACCACTGCATGATTGGAA 

TACCAGCCAACCCTTGCCTGTTCTTGTTCCAGCAAAGT 

GCCC i i i l AAAATAAATTTATGTATATAGTCTCTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTATAGACATAT 

AGAAATATATATTCCTAATTCAGAACTCATTCGTAAGT 

GCACACACTGACATGTGTTTCATGTTTCCCAATTTATC 

CCAGAGCCTATATGCAGTGTTTGGCTGCACAAGTAGG 

CATTAAATGCAACCACTGGGAATGAGAATGGTGGCCA 
CAAC 


SEQID 
NO: 38 


D5S2075 


168569742 

1 UOJ /V lit 


AGCTTTGAAATCCCATCCAAACTNATTGGCGTTTCAAA 

CTGAATCCCAGATGTTCACTTACTGAGAAATAAATGA 

ATGGCCCAATTCGTGGACTGANGCAGGGNCCCTCACA 

AATAGATTCCAGGTGTGTTGGCCTCTGGACCACTATCT 

TTCTCTGITITACATACACATACATACACACACACACA 

AAGTCCATCCTGAAAAGAATTCAACGTCATCTCCAAGT 
TAGAGCCAGTNTAGGATGAACAGAGGTAGTTACCTAA 
CACAAATAACATATTTTCAATTGTGGATGAAGGCAAA 
GGGCTCCACATTCACACTCTTGTGCCTTCAATA 


SEQID 
NO: 39 


DG5S883 


168592111 
168592265 


GGCATGGGATGTTTGACCTAAGAGATGGACTAAAGCC 
AATGAGTAAAAATGTAAAGCGTACTTAGTCAAATAAA 
TTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCCTTATTCA 


SEQID 
NO: 40 
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TCACTCTTGGGCCGTGATGATGATGAGGGAGAGGAGC 
AGT 




DG5S38 


168715977 
168716367 


TATTTGCCTGCCTGGGTTAGATGATTCTCCAGGCTTCT 
ACACAATTTTATGTTTATATGAAAATAGCCACAAAGG 
GAAAAGAGGACAATAAAACAAGAGATATGAATAATA . 

A TOT A TTPrT ATA PTTd AAA TTTOrT A A A A OA GTAG A TC 

TCAAGTGTTCTACATACACACACACACACACACACAC 

ACACAAAGGTAATGAATGAGATGATAGGTGTTAATTA 

AC1TGATTGTGGTAATCACTTCACAATGTATACATATA 

AAAACATCATGTTTTACAACCTACATTTATACAATTCC 

TCAATTATATATCAATAAACCTGGAAAAATAAAGATG 

TATA A A A A AOATTTAPA A ATA A HAT 1*1 "IT A A A AAA GO 

ATTGTGAGGAAACAAAG 


SEQID 
NO: 41 


DG5S37 


168770226 
168770418 


ACCAGCTAACCTGCCATGAGACTGTTGTGTAGCCATCT 

TCACCTCCTCATCTTCAGGGAAGGGGATGAAAATATCT 

GTGCACTGCAAATGTTAACTATATATACACACACACAC 

ACACACACACACACGTACAGTAGGCCCTCCATAACCT 

GAGGTTCCACATCTGCATA1"1"11ACCAACTCTGGTCCC 

X \J\^ 


SEQID 
NO: 42 


DG5S886 


168803195 
168803445 


TTGTTCCTGAATGGGAGGAGGACTGGTGAGTGAGGGG 

GAAAGAATGGAGACAGGACTGAGAAGAACCAGAAAT 

TAAAATAATAGTAGTAATAGCCTAACATGTACACGTA 

TATGAGATCTATCTATCTATCTATCTATCTATCTATCTA 

TCTATCTATCTATCATCTATATATCTATCATCCATCATG 

TATCTATCTATTTGCATATATAAGCTATAATATCTGGC 

IV^/XVJI 1 V^l AA X X vJ X X X 


SEQID 
NO: 43 


DG5S101 


168833451 
168833700 


CCAGGCTTGGATGAGAGAATAATCTTAAGGAAGTCAG 

CATATGTTCTAGAAACATTCAGAAGACAAAAGAGTCT 

GTTATGAAAGAACAAAGTATTTGTAATAATAAATTGA 

ATGTTACATGGACACACCCAGACATACACACACAGAC 

ACACACACACACAGTTTTTCTTCTCTCTCTCTCTCTCCC 

CACTCCCCTCTCTCATAClllGCAAACAAGCTCCTCAG 

CAGCTGGTAAGCTGTTCCCTGTCC 


SEQID 
NO: 44 


DG5S102 


168895047 
168895352 


TCCTGACTGCTCAAAGCTCAAGGTGTTGCCTTTTTCAA 
"a Tnrio a -rnr* a ata nrr^r a r^rr* A ttttpp a a in a tt a a ah 

CTAGAGAGAAGAATGAATGAATGAATAAATAAATAAA 

TAAATAAATAAATAAATAAATAAATGAGCAAAGTTAA 

TATTAGCTGGAAAAAATAGGGTACAGGTGGAAGGAAT 

GAACCCATATTGAGAGTCCACTATGTGTCAAATTCCTT 

GCATGGAATCTCTAAGGTCTGTCTAGCTTAAAAGCAAT 

GCCAGCCTTGCTATCTGTACTTGATGAGGAGATGGATC 

GGAA 


SEQID 

KID' AS 


DG5S39 


168920224 
168920577 


CC AAACTGC AAA CCCAAACTTCTAGAATGAATTCATGT 
GCAACTTATTCTAAAAGATCTATACACACACACACAC 
ArArAPAPArArAPAPAPArArA papa rrTrrTOTPfT 

ATTGCTCTTCACTACTTCCTTCATCTCTGTGCTAGAATC 

TGGGTTCATTTTTCTTCCCCTTGAGTAATTTATTATG^ 

TrilACAGTGAGTCTGTTGCTCAAAAATTCTTTTAGTAT 

TTATTTGTATAAAAAGTCITAATTTTGTCTTCATAT 

A1U11GTTTGACACTCTATTATAAATTGACTGTTATTCT 

CTTTCCATGl 1"11CCGGACATAGTTCCATTGTCTTCTGA 

CTTCCA 


SEQID 
NO: 46 


D5S1456 


168968063 . 
168968256 


TTCNACCTTATGGGTATATCGAATTGTAACCCCGTTGT 
AGGTCAAGGAGCATCTNCATATATACATACATAGATG 
ATAGATAGATAGATAGATAGATAGATAGATAGATAGA 
TAGATAGATTTAATTCTAAAlvriTTCCAAATACT 


SEQID 
NO: 47 
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AlllAAATGATTATAGTTTTACAACAATTTCATATATT 
NTATAGGTAGGAGAATTAGGGTTTTCCAGAGAAATAG 
ANNCAATAGGCTGTGTGTGTATATAANGATTTA>nTm 
AAGA 




JJUJO 1 uu 


169021609 


G Ii UGGCATGATGGTGTGTACCTATAGTCCTAGCTACC 

TGGGAGGCTGAGGCAGGAGGATCCCTTGAGCCCAGGA 

GTTTAAGACTAACAAGACTCCATCTCTGAAAAATAAG 

GCAAAAAAAGTATGAAGAATAAAATAACAATCACTTA 

CATTCCAACCACCTATAATTAATCATTGCCAACACCTG 

AGGATATTTGCTTCCAATCTACAAGACTGCATTATTAT 

TATTATTATTATTATTATTATTATTATTATTATTATTATT 

ATTATTGAGATGGGGGTTTCTCTTTGTTGCCCAG 


SEQID 
NO: 48 


DG5S40 


169067041 
169067434 


GCAGACAGGGTTCAAATTCCAGCCTCCCCTCATATTAG 

CTGTGTGATCTTAGGCAAGTTTATTCATGTGTAAAAAG 

AAATAATAACCTCnTCTCGTGGGGTTGCAGGTTAAACA 

AAAGAGTAGGTATTAAAACAACTAAAAGAGTATGATT 

GGATTGTTTATAACACAAAGGATAAATGCTTGAGGAC 

ATGGATCCCCATCITCCATGATGTGATTTTTATGCATT 

GTATGCCTCTATCAAAACATCTCATTTACTCCATATAT 

ATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

GTGTGTGTGTATGTGTATGCACACACTATGTGCCCACA 

AAAATTAAAAAi l l l AAAATTAAAAAATTTTAAAAAT 

AAACATGCTGCTGGGC 


SEQID 
NO: 49 


D5S504 


169142805 
169143006 


AGCTGCCAGCACACAAGGCCCAGAGGTACTTTATTGG 

ATGCCAAATTCTTTTAACACACCATGAGAAAAGAAGT 

TGACAACTTTCCCATGCAril-rGGAAGGTGTGTTAGAA 

CGATTCAACACACACACACACACACACACACACACAC 

ACACACACACACACATTTATTGGGTTGGGGGAGCCTT 

AAAACTTACAAATCT 


SEQID 
NO: 50 


D5S1961 


169173385 
169173631 


AAlTiereACCCTTTATCrc^ 

ACATGCCCCCCACCCCCCATGAATGATTTGTCTAATGC 

GTTGAATATGCACGTGTGGTTTTGGGTGCATGTATGTT 

TACACACACACACACAGACTCTCTCTCTCTCTCTCTCA 

CACACACACACACACACACACACACACACACACTCAT 

GCCTCTCCTTTGAAGAGGATCAGATATGGACAGCAAA 

GGGCATTAGCATCTTAGCT 


SEQID 
NO: 51 


DG5SI08 


169203019 
169203370 


GCTCCAAAGTTCCTGATTCCAAAGCCTAAGTTAAGCCA 

TCTTAACTTATATTTCAGGTGACTAGTGGAATTTTT^ 

GCCCAGTGTAGGTGGTAATGACGATGGAGATATGGCG 

ATGATGATGATGATGATGATGATGATGATGATGATGA 

TGGCAGTAGTGGTGGTGGTGTTGGTGATGGTGTTGAA 

GATCACCATGTCTGACCACTGTTGTCTGTGTCCTTTGT 

ACAATTGTTCAGATGCAGTGTCCTGGTTGTCACATAGA 

TGTCTCTGACTGTTTTACAGGCCTTCACCTACCACCAT 

ATCCAGGAGATCATGGTCCAGCTGCTGCGGACAGTGA 
ACCGGACAGTCA 


SEQID 
NO: 52 


DG5S41 


169277936 
169278300 


TTAACCCACTGCTCCCATTTTGCTGGTGGAAAACTGAA 

AACCAAAGAGATTAAGTCATTTACCCAAGGTCATGTA 
ACTAATATATTGAATCTC A O A TnTTTT a a tp. a ttttp a 

CTCATTTCCAATTTGCCTGGCTATATAGAGAAAATATT 

TGAGGAATTGACAGGGAACACACACACACACACACAC 

ACACACACACACACACACACACACAGAGGGAGGGAG 

GGGAGAGAGAGAGACAGAGACAGACAGAGACTGAAC 

AGATTATTTCTCCACTGATGTTCATTATTTAGATCTATT 

TTCAACATTTAAAGGCAATTGTCAGCATAGTCAATTCA 

GCCATTTTAAACCATCAAGGGCCAATG 


SEQID 
NO: 53 
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DG5S42 


169285983 
169286144 


GCAACCTATTTGTTAGCAGCACATGCGTGCGTGTGCAT 
GCACGTGGGCACACACACACACACACACACACACACA 
CTCTTTGCAGGGGAA rill GAGCCAGAAATTTATCTGT 
AGGCCCATATTCATTCCTTTTTGCACATTTCTATTGTGA 
CCTTGGGCA 


SEQ ID 
NO: 54 


L'UJj I \J 


169356318 


CAATTCCACACAGCTGGAGAGTAACAAAGCCAGAAAT 

CACATCCATTTGTGTGTGTGTGTGTGTGTGTGTGTGTC 

CAAATCCTGTGATTCCAGTGCCAGGATACACTGTCTTC 

CGTGTTCAACAGTCATGAAAGTATTTTAATGAACACCT 

GGCCCTGCAGTGCCTGATGTAGCAAATGCTGCAGATA 

CTCCACCCACCGACTCTTGGACCACCCAAAATCCACTG 

GCAGCTTCAGTGAGGCTTTCCTACll'Clll'Cl'riCCCTG 

GGCT 


SEQ ID 
NO: 55 


DG5S110 


169391090 
169391341 


CACTGGACTTGGAGTCAGGACATCATTTTAACAGCTTT 

ATCGAGATGTAATTTACATGCCATACAATTTACCCAAA 

GTGTACAATCATTGACTCTTAGTATGTTCACAGAGCTA 

TGTAACCATCACCACAATCAA ITTI ACAACATTTTCAT 

TACTCCTAAAAAGCAAACCTGCACCCCTTAGCCACTGC 

TCTGCCAACACACACACACACACACACACACACACGC 

GTGCGCACACACACCCAAACACTC 


SEQ ID 
NO: 56 


DG5S11 


169409260 
169409401 


CGCGTGTACCTCCACATAGATGTTTGCCAATGCCTATG 
CCCAAGACACACACACATACACACACACACACACACA 
CAGGATACATTCAAGCACACACTAATGTATGTGCACTT 
GCCTGCACAGAGTCCACATCACACAGGC 


SEQ ID 
NO: 57 


D5S1973 


169424532 
169424861 


AGCTTCTAGTCTGCTATGTTGCTAATC 
AT^llil'AAAACCATTTCTGTGAAATTATAGCCTCCTT 

actcccrtaccctgagtctggatgtttctgaagatgac 

tgatctctAcagtgagaaggccctgggaattgactga ; 

ctcactctctctgtctctctctctctcrctctcacacac 

acacacacacacacacacacactcatatacatacaca 

catagatacacatatacatgcatccacacatgcacac 

cctgggcacacccacacaccctacaactgcacatgca 

tgcacacacataatgttaactgaagg 


SEQ ID 
NO: 58 


D5S397 


169542970 
169543287 


AGC1111GGCTATGGAACCTTAGGCAAGATGTTCATAA 
ACCC11 1AATCTCTAGTGCCCTTGTTCATAAAAAGAAG 
TGAATCGGATCCCTGCAGGACTGTTTTTGTATTCAGTG 
CACAGGTGTGTGTG AAGACACCCAGCATGTTGCCAGG 
CACACAGAGATGTCTACC^GATACTTTTCTCTCCTCCT 
CCCCGCAAATACACACACACACACACACACACACACA 
CACACTCACACACTCTTA'rrrrGATCTTGGCCTGAGGC 
TGACAAGCCCCAGATTAGTGATCAGTGACAATTTCGG 
CI I 1A1CAGCT 


SEQ ID 
NO: 59 


DG5S115 


169586308 
169586550 


CGAGGATGCACACCTCATTAATTGAGGAGCTAGGATT 
TAGATCCAGAGCCTCATGATTCTAAAGCCTGTTTTTTG 
TTTGTTTGTTTGTTTG li' 1 " 1 GTTTTGGCCACACTAGGTT 
TCTAGAAACTTCCAGTTCCTTCTTAAAAGTCCTTTTTGG 
GCATTCCGGCCTAAATCCCAAAACTGTGGTCTGGGTAC 

AAGAGAGAATTAGGCCAGTGAGAAAAATTTAAACCAC 
CCTGCCCTCT A A A T 


SEQ ID 
NO: 60 


DG5S88S 


169653226 
169653848 


CTTGATGTCTTCTTTCACCATCCCCTGGCACCCGCCTAT 

ACATTTATTACTTGAAACAGACTGACCITTATTTGGTT 

AGGCCACTGTGGTCAGGTTTCTGCAACATGGGGTCAC 

ATGCCITCCCAACTGACACAAGTCTCAAGCTCCTTTTC 

TC1TC1TTTTATAACTTCTAGAAGCATAGCTTCTACCAG 

ATAAGGATCTAACCTTTTCAGTGGAAAACAAAAATGG 

CAAAGAAGTAAAGAAAGAAGAGAGAGAAAGAAGAAA 


SEQ ID 
NO: 61 
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GAAAGAAAGAAAAGAAAGAAAGAAAGAAAGAAAGA 

AAGAAAGAAAGAAAGAAAGAAAGAGAGAAAGAAAG 

AGAGAAAGAAAGAGAGAAAGAAAGAGAGATGGAGAG 

AGGGAAGGAAGGAAGAAAAGAAAGAGAGGGAGAAG 

AAAGAAGACAGGGAAGGAAAGGGAAGGGAAAAGAG 

GGAAGGGGAGAGGGGAGGACAAGGGAAGGGGAGAG 

GGGAGGACAAGGGAAGGGAGGAAGGAAGAAAGGAA 

GGAAAGAAGGC AGGAAGGAAGGAAGGAAGGAAGGA . 

AGAAAGGAAGGAAGGAAGGAAGGAAAAATAACTAGG 

GCCITTCACTTTTGCCTTCAATAGCAGAGTGGCCCTGG 

ATAT 




DG5S44 


169661202 
169661574 


TATTGGCAGAGGGTGAGTCCAGTGTATAAAAGCAACT 

ATATTTGTGCAATAAGGCAACCTCTAAACACAAGTTAC 

TACTTCATCTAATGCCACACACACACACACACACACAC 

ACACACACACACGAGTCATCTGTTCCAAGGCTGTTGCC 

TTTACTAAGTGATGCTATGTTGGTCCTTGAGGTGGTGC 

CTTCCTGAGGG1T1TCAAGCATAGCTTTGGCCATGCAC 

AGTTTTCTTCTTATACACACTCTGAGGAGCCCCGCCGT 

CACGGTAATGCACCTGCCTCACAAGCTGGTGGGCAGC 

TTAAATGAAATACACATTTTGCTCCAGGCCCAGCACTA 

GCTCATCAATGTGAGCTGGTGTTAGCCTCACC 


SEQID 
NO: 62 


DG5S45 


169693772 
169693912 


CAGTAGCCAGGAAGCTGAGGAACACACACACACACAC 
ACACACACACACACACACACACAAACACACCCCTTCC 
TGGCTCCAGTTCCGCACCACCCCACACCCCCAACACCG 
GAAGTAGATTTCTCAATAGGCAGGGCTG 


SEQID 
NO: 63 


DG5S46 


169702377 
169702678 


TTTGCCAGAATGTCCTCACACCAAATAGTGGACCCCTT 

Cri u rrGCTGATTTATCTGCTATTGTATAGGTGTATGTGT 

GTGTGGGTGTGTGTGTGTGTGTGTGTGTTAAGGCAGGT 

GGTAGTATGTGTAGGGTAGGGTTTCCCCAGTCACCTGG 

AGCCCTGAGTGCCTGCTTCCCTAAACTAGGCCAGTTTA 

GCTGACTGGCITCC1UUGTGTATTGGTCCATTCTGCATC 

AAAAGCATCTGAATTTTCATTCAATCTCTCTTCTGAAT 

TTTCACTTTTAAAAACCTGACCAGTCCCTTGTG 


SEQID 
NO: 64 


DG5S47 


169788696 
169788899 


CTCCTCCATGGTAGGGACTGGTTCTCTTAGGGCCCTGT 

ATCCTCAGGCCCAGCATGCITGGGAAAATGTTTGCTAA 

TGCTTTGTGACTCAAAAGGAATCACACACACACACAC 

ACACACACACAAACACACACACACAGTTTTTAATATT 

ATCAGTCATATCAGCCCCCTGAGGCAGCTGCTCTGTTC 

CAGACAAACCCTGTT 


SEQID 
NO: 65 


DG5S119 


169843903 
169844041 


GGGTACAGGAGAGTTGTGGTGGGCATTAGTACTACTC 
CTGCTGCTGCTGCTGCTGCTGCTGCTGTGTCCACTGTT 
AGTGACAGAAGTGGGAAAATATTTAAGTTGAGTTCAC 
ATTAGTGTTCCCAGTTTAGCGTGAGC 


SEQID 
NO: 66 


DG5S953 


169866165 
169866415 


CCATGAGTTCAGGCAGTGGGTTAAATAAGATTTCCCTT 

GAAGTCGAATGAAATCACAATGCACCACACACAGGGA 

CACACACACACACACACGCACGCACGCACATCAGACA 

CACACACACACACACACACACACACACACACACATAC 

ACACACACAGTCTCCCTGGGGCCAATCTACTGCCCCCT 

GAACCTCACCCATCAGCCAGGTGCCTGGCCCCGGGTCT 

GTCTCTTAGGGTTACATGCTCCCGG 


SEQID 
NO: 67 


DG5S955 


169951970 
169952619 


ACTTATGGAACACCTACTCAGTGCCAGGTATTGTTGTA 
GATGCCAGGAGTACAGCAGGGAATAAAACAACATCCC 
TGTCCTCGACACAAACACACAAGTAAATAGAGAAGGT 
CAGAGATAAATGCTGTGCAGGAAAACAAAGCAAAGTG 
AGGGATGGAGAGTGCGGAAGGTTGGGGCACTTTTGTT 
TCAGATGAGTGTCAGGGAAGCCCCCTTGGAGGAGGCA 


SEQED 
NO: 68 
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CTGTAAGGGCACAGAATCGAATGAAAGGAGTATGTGA 

AGGTGCTTAAATTGTTTCTGTTTGGTTTGGTGTGGTGT 

GATGTGGTGTGGTGTGGTGTGGTGTGGTGTGGTGTGGT 

GTGGTGTGGTGTGGTGTGGTGTGGTGTGATGTGATGTG 

GTGTGCGGTGCGGTGCGGTGCGGTGCGGTGCGGTGTG 

GTGTGGTATGGGTTGAGGCTGGCCTTAGGAGCCTGTTG 

GCCTTCCAGGCCAGTCCTGAAGCCCAGCCCAGAGCAC 

CAGACTCTGCAGTCAGTCAGTGGAGGGCCCACATCTC 

AGCCAATGCATGGCrri GGGTGGTGACTTCATCTCCCC 

TAGTGTTCCTTTCCCCCTCTGCAAAATGGGAATGGGGA 

TGGCTCAGAACTCCCAGCGGGAGTTAGGAGGAATAAT 

GTATAGGAAGTATGAGCAGAGTGCCTGG 




DG5S13 


169961410 
169961530 


TGATGTGCTCGTTCCCATAGCCCCGCTGTGTGTGTGTG 
CGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTTTGGTGGGGTGGGAGGGGAGGCAGAAGAGGAAG 
AGAGGGCA 


SEQID 
NO: 69 


DG5S123 


170015858 
170015997 


TGGTGATCAGCTCAGTGTCCTTGGAAAAGAGCAGAAA 
GTGGTATCACGAACATATCTTCTCCl"rrGCTTCCTTCTC 
CTCACTCTrTCATCATCATCATCATCATCATCATCAAAT 
ATGGATCTGTGAGGCTACCTCTGGG 


SEQID 
NO: 70 


DG5S124 


170041996 
170042336 


GGAGGAGAGACCAGCATTCACATTCAGTTATTGTTGTT 

TTAAATCCATTACGCACATACATAGGAGAAAATTTCA 

GCAACAGTCACCCTCTGAACCCAGTTCCTCAGTTCTCT 

CCAGAGGCAACTAAAATGCTCAATTATTAGTGTATCCT 

TTTGGAAATA'111"1ATGTATATGACAGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGTTCCri"l'CCAATATTA 

AAATAATATTAACATTGGTAATAGTGGTACTAAACAA 

CTTAGGQTQlTlTnTllllJATITAATAGIATATrriTA 

GTATCri'ICCAGGAAAAGATACATGGATGTGCCACA 


SEQID 
NO: 71 


D5S625 


170105556 
170105787 


TCAATAAATGATTCTGGGGATGTGTCTGTCTGTCCATC 

TGTCTCTCTCNAGANACANATACACACACACACACAC 

ACACACACACACACACACATCCTGTTAGTTCTGTTTAC 

CTGGAGAACCTTGACTAACATACCCATTAAAACCAAA 

ATATGTCCITCAGGGTGTTAATGTTTGGTTGAAGAAAC 

ACAGAAGTTTAACAATTGTATCAGGCTGGGCACGGCC 

TATAATCCCAGCAri-l-l'GGGAGGCCACAATGAGNGGA 

TCACTTGAGCCCAGGAGTTCTAGACCAGCCTATGCAAC 

ATAGTGAGACAAAAAAATGAANAAAATTAGGGGTGTG 

GTGGAGCGCACCTGTAGTCCTAGCT 


SEQID 
NO: 72 


DG5S959 


170167429 
170167616 


GAGTTCTATGGAACAGCATTTATTGAATAATAACATTT 
CAGGAAAAAATATAAGC1T 1 ACTGTATATTAAAATAC 
ATATATACGTTTATATATTATATATTATATTATATTATA 
TATTATATATATTATATATATTATAATATTTATATATTA 
TATAGATATAAATCAACTACAAGATCCAGTTCAA 


SEQID 
NO: 73 


DG5S960 


170203240 
170203459 


TTGCCTAAGATCTAGGTGTTCAAAAGAATTTCTCCAAT 
CCACTTCAGCCTGTTATTATGTATGTATTCTGTTTTAAA 
TGAGAAACAAGAGTCATTTTCTCCAGAATAATAGAAC 
CATAGTGACACTTGAAGTAAGTCCAGTGGTCCTGATAT 

1 m a 1 A A nr« A A m a a f~t*t a A T~» 1 l rwn A iicri A rfw ■ » a O w T* A » I"" 1 ' A n»n a 

GATAATAATAATAATAATAATTATTATTATTATTATT 
TTATITI 'G AGACGAGGTCTGTATCTGTTG 


SEQID 
NO: 74 


DG5S16 


170280782 
170281084, 


ATGGAGAGACACGGAGTTGCTTGAGGGTACAGTGCCT 

GTAGATACTCAATAAATATTTGTTTAATTAAGAAAATT 

TCTGTTATTTGTGTGCTCATACATACCATTTCAGTCTGG 

TGAGTATTGTTCTTTCCTAGAGTTTACTl^l'AATCTTAA 

GTATTTTCCAGGTCCTTTGTTGACTTCTGTTTAAACCAC 

AGTACACACACACACACACACACACACAACTTTTGTG 


SEQID 
NO: 75 
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TACTATAATAGCTTCCCCAAAATTATAATTTAGTCATT 
GTGATGCAGATCTTCITCCAAGGCCTCTACTTTGG 




DG5S962 


170338421 
170338789 


AAACAAACAAACAGAAACACCAAAATGGATTCCAGCA 

TCTTATAAGTGCTTTCTCTTATGATCGAGAGTAAGACA 

AGCATGGCTACTCCCTTCTCTTCTATTAAATATTGTACT 

AGGGGTTCTATTGAGATAAATAGGCAAAAAACAAAAC 

AAAACAAAACAAAAAGGCATCCAGATTTAAAAAAAA 

AAGGAATCTAGGAATAAAGGGATTACATCTCTACTTG 

CAGATGACATGATCTTATGTATAGGAAATCCTAAGGA 

TCCACTGAAAAACTGTTAGAACTAATAACATCAGTAA 

GTTTGCAGGATTATAAGATTAATACAAAAACTCGACT 

GAATTTCTGTGCACTTGCAATAAACAACCCAAA 


SEQID 
NO: 76 


DG5S132 


170442700 
170442947 


TCTGCCCACACACTTTATGCITTAAAACAAAAGGCCAT 

GTTGAAGTTGTAGAACCAAATGATTGCTAATTACTTGG 

GGCGATACTAGTGATATATTATCTTACATACACACACA 

CAAAACACACACACACACACACACACACACACACACG 

GCTTGAGTCCAGCATGGCCTACTGATTTTAAAATAGGA 

AATGACAGTGTAAATGCCAGGATAAAGGACAAAGTGC 

TCTGACCTGTTGCCAAACCTT 


SEQID 
NO: 77 


DG5S136 


170469573 
170469843 


GGGTTTAGGACCCACGTATCTTTTGTTTGTTTGTTGTTG 

TTGTTGTTGTTTTGGAGTTCAACATGTTTATGGTGTGTA 

GCCATGGTTGAAAGCTTTTATTTTATAAGATAGACAAA 

GCAGGAATTATTATTCTCA T 111 ACAGGAGAAAAACA 

GAAGGGCTATGTGGTTTGTTAAAAGGCCACACAGACA 

GTAAAGAACACAGCCTTTACATGGTCAGCCTCACATTC 

TAGTACTCATTTTATTACACTGCTCT 

CTG 


SEQID 
NO: 78 


DG5S133 


170480360 
170480621 


CTGGCCTCTTTGCCCATTTTCTAATTC 

GG1111GTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTT 

TGCTGTTGTGACGTTAGAGTTCCTTGTATATTCTAGAT 

ATTAATCCCTTGCCAATTAATAGTTTGCCAGTAITTTCT 

CCCATCCTGTAGGTTGTTCACTCTGATGATTGTTTCCTT 

TGCTGTGCAGAAGGTATTTAATTTGATATAATCCCATT 

TATTTACTTTTGTTTCTGTTGCCTGTGA 


SEQID 
NO: 79 


DG5S17 


170499980 
170500284 


CCATCCAGGGTCTAACTCCAGCATTTGTATAAACTTGG 

ACAATACTTTTGCTACAGGGTTGTCATTGAAAGTATTG 

CCTCATTATATTTCTTAGTGGTCCCTGTATGAAGCCAT 

ATAAGAGAAACTTCITAATTTAGCACTAGGAAATGCTT 

CTGTTGACTTGAGATGTGTGTGTGTGTGTGTGTGTGTG 

TGTGTGTGTCTGTGTGTGTGTGTGTGTGTCTGTGTGTGT 

GTGTATTCCCCTAATTGATAAACTATAAAATAATCTTT 

CTCTTTTCACTTTGGCCATCTGGAAATTTGCCACCAA 


SEQID 
NO: 80 


DG5S137 


170644993 
170645364 


TGGCTTCCCAATCCTAGAAAAGGAAGAAAGCTGCATG 

TTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGT 

TGTTAGATGGAGTTTTCTGTCGCCCAGGCTGGAGTGCA 

GTCACATGATCrCAGCTCACTGCAGCCTCTGCCTCCCT 

GGTTCAAGCAATTCTCCAGTCTCAGCCTCCCAAGTAGC 

TGGGATTACAGGTGCGCACCACCACTCCAGGCTAATTT 

TTTGTAi 1 1 1 lAGTAGAGACGGGGTTTCACGATGTTGG 

CCAGGCTGGTCTTGAACTCCTGACCTCAGGTGACCCAC 

CTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTG 

AGCCACCGCACCTGGCTGAAAGCTGCAT 


SEQID 
NO: 81 


DG5S53 


170673106 
170673364 


GCCTTCGCAGATTGTACCTCTTCCITrCACCCTTCTCGC 
TGGCCTGTGCTTCTCTCTCCATCGTGGTCTCCCACGCCT 
TGGTTTCTCCTCCATCCCCATCCCCATCTTTCGTGAGCC 
CCTCCAACTCTCTCCCCGTGTTTTGTACGGTCTCCTGCG 


SEQID 
NO: 82 



iNSDOCID: <WO 20040411 93 A2_l_> 



WO 2004/041193 



-112- 



PCT/US2003/034681 







TTCACTTGATTTCCTCTCACCCACCCCCGCCCCAAACA 
CACAGGCACACACACAUACACACACACGCGCACACAC 
ACGGGCCTCTCGCACTCTCCTTCTCCT 




DG5S968 


170675807 
170676033 


TGACTCITGGCCTCTGTGTGTCTCTGGGTTTCT 
CCCTCCTCTCCACGGTCCTCTTGTCCTTTTGTCTTCCCT 
TTCITGTTTCTTG AATCTCTTTGCC 111 ATGT ATCTGTCT 
TTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 
TCI 1 1 CC ill CTCCCTGACTCCCTCTCTCCCTCTTCCAG 
GCCCAGCTCCCAGTAGCTCCTAAGGCAAA 


SEQID 
NO: 83 


DG5S904 


170735417 
170735632 


CA'lll'GGUAlAAATGTTGTCTTTAGlTITCAACrACTTT 

TTCTTTGGCTTATTCTC I'C 1 CTCTCCCTCCCTCCCTCTCT 

CTCTCTCTCTCTCTCTCTGTGTGTGTGTGTGTGTGTGTG 

TGTGTATTCATGTTTTCTTAATCTATCTGAATTGTTGTG 

TCGGTTTTCCATGCGAATTTCCAGTTACCTCCACAGTA 

TTCGTTTCAGAATGCTTCCT 


SEQID 
NO: 84 


DG5S906 


170820130 


TTGCAGTTTCATGAACCAAGTATTACTGCCTCAACAAT 

TAAAAACAACAGACAAATTATTTAAAAAACCATGAGG 

CGAGTGGTGGCTGGTGGCTGGTGGCAGGGCGGGGGCA 

GGGTGGCCTCTGTGTCTCATGCTTTCTGGTTGGTCTGT 

GOTU1T1 GCACTGAGAGCTAGGGCCTTGCACATTCATT 

CATTCATTCATTCATTCATTCATTCTTTGAATTCAACAT 

TACTATGCACCAGGCGCTGAGAAGGCAGCCTTAGACA 

GATGGAAATCCTTGCIl'rCCGGGAGATTCCATTCTAAT 

GGGTCATTGATTCAGTGGCCTCTTCAGTCATTTGTTCA 

TATGCATTTACTCGTACCTCTCATGTGCCA 


SEQH) 
NO: 85 


DG5S141 


170910447 
170910786 


AACTGAACCTGGGCTGTGTCAGTCTAAGACTTATGCTT 

GGAACCTGTAAGAAGAACAAGTGTGCGTGCATGCATG 

TGTGTGTGTGTGTGTGTGTGTGTTTGGAGGGATAGTGA 

AGTTTTTTCCTACAGCTACAAATAGAACATGCTTTC^ 

ATACAATTGTACTAATCAATATTATTTCCTTACATTATC 

TCCAGCCATTTCCCTATAATTAGACATTCAGATTATTT 

CAAGGTTGTTATTCCTATAAACAGTGATGTGATGAATT 

TTTTAAAGTTGGTTCCTCACATCCGTCTGTTCTTGTA^ 

TGTATCCATCATAGAACTGGACCACAAAGGTTGG 


SEQID 
NO: 86 


DG5S909 


170941109 
170941259 


GGGAGTTCTCTTTCTTCAAGTGCTTAGGGGAGAAAATA 

AGTGAGAAAAAGAGAGATAGAGGAAGGAAGGAAGGA 

AGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGA 

AGAAGAAAAGGAAACGAAAAGGAAAGGAAAGGAAG 

AAAGGA 


SEQID 
NO: 87 


DG5S910 


170946679 
170947010 


TCCATGTTATTCTCTACCTGTTGGTTCCTTCCTCATTGA 

AAATTGGTGTATAGATGGTAGATAGATAGATGATAGA 

TAGATAGGTAGATAGATAGATAGATAGATAGATAGAT 

AGATAGATAGATAGATAGATAGATTTTTATTTTTGGTC 

TATCTCCTTTACTAAACAGTAAGCTCCATGAAAATATG 

GATCATCACTGTCTTATTCACCATTATATTCTCAGCAT 

ATGGTATTGTCCTGGTATAGAATAGATTCTCAATAAAT 

GCTTGCTAAATGAATGCATTCATGAGTGAGTGAATGA 

ATGAATATGCGAGTGGATGAGTGTGTGGA 


SEQID 
NO: 88 




170986066 


AAACCAGGGATTGAAACAGGATCTGGCATGCAATGGG 

CTGGATGGGTGAATGGAGAAATAGATGAATGACAGAT 

GGATGGACAAACAGATGGAAGGAAGGATGGATGGAT 

AGATGGATGGATGGATGGTTGGATGGATGGATGGATG 

GATGGATGGATGGATGGATGGATGGATGGATGGACAG 

ATGGATTGGTTGGTAGATGTGTGGATAGATGGATGGG 

TGAACAAGCGAGTAGATGGATGAGTAAATGGCTAAAT 


biiQ ID 
NO: 89 
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CTGGTGCTTTTCTTCCAGAATCCTGGATTCTGAAGGGA 
GGCTTTGCAGCCCTTCCTCGTGGATCACTTGCTCTG 




DG5S143 


171018986 
171019237 


GGCTGAATTACTGGGCATGrriCTGAGAAGAAAGAAC 

TTCTAll"l"rAATTATATATCTACAGAAACCAAATTGCC 

TGCTTACAGTTTTACATGTCTGATGATTGGAAGTTTTT 

GTTTGTTTGTTTGTTTGTTTGTTTGTTTCCACA 

CCTCTGACTCCATATATTTCAAACTTTGTTCCTCTTCCA 

CTACCCACATATTTCTGATGTGAGACATTCTAGAAAAA 

TTTCATATTGCAAGACGGCTTC 


SEQK) 
NO: 90 


DG5S513 


171039003 
1 / lujyjoo 


TTGGCAGGATTCAGTTCCTCATGGGCACCAGACGGAG 

AGTCTCAGCAAATCACTAGCTGGTGACTGCAGCCACC 

GCAGTTCnTTATCACGTGGTTCTCTCCATAGGGCAGTT 

CACAGTGTGGTAACTTGCTTCATCAGAGCAAGCCAGG 

AAGAAGACCCAGAGACAGACAGAGAGAGAGAGAGAG 

AGAGAGAGAGAGAGGAGGCCTCTGAAAGAGAGAGAG 

AAGAGAGAGAGGCAAAGAGAGAATGAGAACTCCAGA 

AGTCACTGTCITTTATAGKDCTAATCTTGAAAGTGACAT 

TCATCACITrCACTACATTTTCTTCCCCAGCAGTGCTCA 

GTGGGAGGGGATTATACACGGCCATGGAT 


SEQID 
NO: 91 


DG5S145 


171040948 
171041151 


GCACATTTGCAGAGGTTTGAGGTCCCATCATTAGCCAT 

GCTTCTTGGTTCCTGCACTATGAGTATACGTATGTGGG 

CTGATGGCCTCATTCACTGGATACACACACACACACAC 

ACACACACACACACACACACACACACCTCACCAGGGA 

CTTGGGAGTATCTAAATGTTTGAGAATCATAGAGCAG 

GGAGACATCCAACAC 


SEQID 
NO: 92 


T\/TO 1 A C 

DG5S146 


171073796 
171074122 . 


GGGCACATACAGCTTTCCTTGCAGGAAAAAAACCTGC 

TTAACITTGTTTATTATATATTATTTGATCTGTGCTTCA 

TATATTATTCATATATTATTTGATCAAGTTGCTTCATGT 

ATTATTTGATCAAGGAATCATGTGTGTCTACAGCACCT 

ATTAAAATTCCCTGGCACTGAAATTCTGTAGAAAACCA 

TTTAGGAAAAGTTGATCTAACTGTATAATTATTAGTAA 

AACATATACACACACACATACACACACACACACACAC 

ACACACACACACCACAAGCAAACAAAAAAAAACCAC 

CTTAATGGTCTCCTAACCAAGGCA 


SEQID 
NO: 93 


DG5S147 


171107565 
171107831 


GCATGTTCGCCACAGAGATTCAATTAATTTAAATAGGT 

AGAGGACTTGGGGCAGTGCCTAGGACAACATTACACT 

CAGGGATGGTGATGATGATGATTATAATGATGATGAT 

GATGTTGATGATGATGATGTTGATAATGATGATGATGA 

TGATGATGATGATGATCATGATGATAATGGAAAAGAA 

GATAGAGGAGGTAGAAGAGGAGACAATCATGATGTTG 

GAGGTAGACTCCAATCTTCAGAATCAGAAGCTCAGGG 

TTGGA 


SEQID 
NO: 94 


D5S462 


171134297 
171134396 


AGCTTAATCTATTATTTNAGAGGCAGAAAGTTAACTTG 

CTTATCCTGAAAAGAAGTGCAAATATATCCCAAAAGT 

GCCATTCTTTCATTCATCCACTCAAACAGATACACACA 

CACACACATACACACACACACACACACACACCCTTTTC 

ACCCCTTGGTAGTGTACAGTCTCTGAGTTGTAAAAAAT 

AGTCATTNCTTTCTGCTTGAAAGACTGTATTAGCT 


SEQID 
NO: 95 




171 140Q7S 

1/1 l*T\JJ' / J 

171141303 


ULAUOA 1 (jrA 1 LL 1 A 1 UAA 1 LL 1 TATAAAAGGGAGATG 

GGAAAATTTCACACACACACACACACACACACACACA 

CACACACACACACACACAGATAGAGACAGAGAGAAG 

AGGATAAGGCAATGTAACCATGGAGGCAGAGATGGG 

AGTGCTGTAGCCACAAGCTAAGGAATGCTGGCAGCCA 

CAGATGCTGGAAGAGTTGAAGAATGGGTTCCCCCTGA 

GGGAGCACAGCCAGATGCATGCTTTGAGAGTTCAGCC 

CAGTGCTACTGACTTTAGACTTATGGTTTCCAGAACTA 


SEQID 
NO: 96 



NSDOC1D: <WO 2004041 193A2J_> 



WO 2004/041193 



-114- 



PCT/US2003/034681 







CAAAAAATTAATTTCTGTCGTTTCAAACCATPP 




DG5S9I4 


171219902 
171220159 


CAAACGTCGCTGACCTGAGTCTGACCTGGGCTGCCTCG 
TGTTACCAACATGAAAAGGGAGTGAGAAAATCTGAGG 
CCAATTAACTTCTCTCCCTCTCT 

TGCCCACCTCTCTCTCTCTCTCT 
TCTCTCTCTCTCTTTTTTCC^ 

ATGATGAAATTTCCTG A A AC A A A A A rTPf^P A nrprnT 
TCAATAAAATGCTTTCGCCTTTGGTG 


SEQDD 
NO: 97 


DG5S150 


171232854 
171233077 


ACAGTTGCCATTTGCTCATTTAAAATGTAGTGAGGTGT 

TTTAAAGAGGGTTTGTTCAATTTACCAAAAAGGGAAA 

AAAAGGGAAAAGAAGAAACTTATTGTTGAACGAACAC 

ACACACACACACACACACACAAAGAGCCTGGCTTAAT 

TTAGGGATAAAGCAAAGAAGTCAATACCCCCACATCA 

ACTATTGAAACCTAAGCTATTGCTGGAGTTGACAGCG 


SEQH) 
NO: 98 


D5S429 


.171276128 
171276490 


AGCTCTN CCTAGCATTGTTTTCCTTTGCTTCATTTCCTC 

TTAAATGTGTTGGATGCACTTNGTTCCTGCTAACTAAT 

CTATCTTTNCAGTTTCAAATCAAATGAACCCAGAGAAT 

TTAITr 11 ACATTATTATCTTCAGATTTAGATTTGTT^ 

GC1111AATCCTGTCTTCATGAAGGGGAAAGCCATGTG 

TACCAGCATGGTTGATAAACCACCAAATCGTGAAACT 

TTGCTTGCTCCCCAAACCCCCAACCACACACACATACA 

CACACACACACACACATACACACACACACACACACAC 

ACACACACACACACACACAACCTGGGAAATTGGGNAG 

AAAACTGGCAAACCTTAAACTAG 


SEQID 
NO: 99 



5 



10 



15 



INSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-115- 



PCTYUS2003/034681 



Table 7 The DNA sequence of the microsatellites employed for the association 
studies across KCWP1 (including Build 33 locations). 



NAME 


POSITION 


SEQUENCE 


SEQ ID NO 


DG5S1173 


169653708- 
169653840 


AAGGGAAGGGAGGAAGGAAGAAAGGAAGGAAAG 
AAGGCAGGAAGGAAGGAAGGAAGGAAGGAAGAA 
AGGAAGGAAGGAAGGAAGGAAAAATAACTAGGG 
CCTTTCACTTTTGCCTTCAATAGCAGAGTGGCC 


SEQ ID NO: 
100 


DG5S44 


169661202- 
169661574 


TATTGGCAGAGGGTGAGTCCAGTGTATAAAAGCAA 
CTATATTTGTGCAATAAGGCAACCTCTAAACACAA 
GTTACTACTTCATCTAATGCCACACACACACACAC 
ACACACACACACACACACACGAGTCATCTGTTCCA 

AGGCTGTTGCClllACTAAGTGATGCTATGTTGGTC 

pTTnA nnTncTnprTTPPTn a rzrzni th * i a a ns~< a t* 
^1 1 vj^-vvjo i vjvj 1 UL/L 1 1 L^L, 1 Ij/vLjvjvj 1 1 1 X L^AAlJV-- A 1 

AGCTTTGGCCATGCACAGTTTTCTTCTTATACACAC 

TCTGAGGAGCCCCGCCGTCACGGTAATGCACCTGC 

CTCACAAGCTGGTGGGCAGCTTAAATGAAATACAC 

ATTTTGCTCCAGGCCCAGCACTAGCTCATCAATGT 

GAGCTGGTGTTAGCCTCACC 


SEQ ID NO: 
101 


DG5S45 


169693772 - 
169693912 


CAGTAGCCAGGAAGCTGAGGAACACACACACACA 
CACACACACACACACACACACACACAAACACACC 
CCTTCCTGGCTCCAGTTCCGCACCACCCCACACCCC 

G 


SEQ ID NO: 
102 


DG5S46 


169702377- 
169702678 


TTTGCCAGAATGTGCTCACACCAAATAGTGGACCC 

P" I w J PTTTTnrTn A TTT A TPTPPT a ttht ata r^nTy-" t 

ATGTGTGTGTGGGTGTGTGTGTGTGTGTGTGTGTTA 

AGGCAGGTGGTAGTATGTGTAGGGTAGGGTTTCCC 

, CAGTCACCTGGAGCCCTGAGTGCCTGCTTCCCTAA 

ACTAGG^CAGTTTAGCTGACTGGCITCCTTTGTGTA 

TTGGTCCATTCTGCATCAAAAGCATCTGAATTTTCA 

TTCAATCTCTCTTCTGAATn^ 

TGACCAGTCCCTTGTG 


SEQ ID NO: 
103 


DG5S1178 


169745438 - 
169745539 


GTGCTCAATGGCTGTTGAATAAATAAATGAGAGGA 

GGAAAGAAGGAAACAAGGAAGGAAGGAAGGAAG 

GAAGGAAGGGAGGGAGAGAGGGAGGGAAGGAGG 


SEQ ID NO: 
104 


DG5S47 


169788696- 
169788899 


CTCCTCCATGGTAGGGACTGGTTCTCTTAGGCCCCT 
GTATCCTCAGGCCCAGCATGCTTGGGAAAATGTTT 
GCTAATGCTTTGTGACTCAAAAGGAATCACACACA 
CACACACACACACACACAAACACACACACACAGT 
11 11AATATTATCAGTCATATCAGCCCCCTGAGGCA 
GCTGCTCTGTTCCAGACAAACCCTGTT 1 


SFO TD NO- 
105 


DG5S1592 


169794522- 
169794686 


TTGAGCTGTTTGGCCTCAATGGCATTTTATCTCTCT 

CTCTCTCTGTGTCTCTCTCITrCTCTrillllllllTT 

CACATTGAGCCATCITCTTACAGCTGAGGTTTTCAT 

ATAAAAAAGCAAGTTGCTGGTTTCTCTTTAAAAGT 

AGGGCAATCTGGCAGTTCT 


SEQ ID NO: 
106 



;NSDOCID: <WO 2004041 193A2_L> 



WO 2004/041193 



-116- 



PCT/US2003/034681 



DG5S119 


169843903 - 
169844041 


GGGTACAGGAGAGTTGTGGTGGGCATTAGTACTAC 
TCCTGCTGCTGCTGCTGCTGCTGCTGCTGTGTCCAC 
TGTTAGTGACAGAAGTGGGAAAATATTTAAGTTGA 
GTTCACATTAGTGTTCCCAGTTTAGCGTGAGC 


SEQ ID NO: 
107 


DG5S955 


169951970- 
169952619 


ACTTATGGAACACCTACTCAGTGCCAGGTATTGTT 

GTAGATGCCAGGAGTACAGCAGGGAATAAAACAA 

CATCCCTGTCCTCGACACAAACACACAAGTAAATA 

GAGAAGGTCAGAGATAAATGCTGTGCAGGAAAAC 

AAAGCAAAGTGAGGGATGGAGAGTGCGGAAGGTT 

GGGGCACITTTGTTTCAGATGAGTGTCAGGGAAGC 

CCCCTTGGAGGAGGCACTGTAAGGGCACAGAATC 

GAATGAAAG 

GAGTATGTGAAGGTGCTTAAATTGTTTCTGTTTGGT 

TTGGTGTGGTGTGATGTGGTGTGGTGTGGTGTGGT 

GTGGTGTGGTGTGGTGTGGTGTGGTGTGGTGTGGT 

GTGGTGTGATGTGATGTGGTGTGCGGTGCGGTGCG 

GTGCGGTGCGGTGCGGTGTGGTGTGGTATGGGTTG 

AGGCTGGCCTTAGGAGCCTGTTGGCCTTCCAGGCC 

AGTCCTGAAGCCCAGCCCAGAGCACCAGACTCTGC 

AGTCAGTCAGTGGAGGGCCCACATCTCAGCCAATG 

CATGGCITTGGGTGGTGACTTCATCTCCCCTAGTGT 

TCCTTTCCCCCTCTGCAAAATGGGAATGGGGATGG 

CTCAGAACTCCCAGCGGGAGTTAGGAGGAATAAT 

GTATAGGAAGTATGAGCAGAGTGCCTGG 


SEQ ID NO: 
108 


DG5S13 


169961410- 
169961530 


TGATGTGCTCGTTCCCATAGCCCCGCTGTGTGTGTG 
TGCGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTTTGGTGGGGTGGGAGGGGAGGCAGAAG 
AGGAAGAGAGGGCA 


SEQ ID NO: 
109 


DG5S123 


170015858- 
170015997 


TGGTGATCAGCTCAGTGTCCTTGGAAAAGAGCAGA 
AAGTGGTATCACGAACATATCITCTCCTTTGCTTCC 
TTCTCCTCACTCTTCATCATCATCATCATCATCATC 
ATCAAATATGGATCTGTGAGGCTACCTCTGGG 


SEQ ID NO: 
110 


DG5S124 


170041996 - 
170042336 


GGAGGAGAGACCAGCATTCACATTCAGTTATTGTT 

Gl I'l'l AAATCCATTACGCACATACATAGGAGAAAA 

TTTCAGCAACAGTCACCCTCTGAACCCAGTTCCTC 

AGTTCTCTCCAGAGGCAACTAAAATGCTCAATTAT 

TAGTGTATCCTTTTGGAAATATTTTATGTATATGAC 

AGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

TTCC 1"1 " 1 CCAATATTAAAATAATATTAACATTGGTA 

ATAGTGGTACTAAACAACTTAGGGTGTTTTTTTTTT 

CATTTAATAGTATATTTTTAGTATCTTTCCAGGAAA 

AGATACATGGATGTGCCACA 


SEQ ID NO: 
111 



WSDOCID: <WO 2004041 193A2_I_> 



WO 2004/041193 



-117- 



PCT/US2003/034681 



D5S625 


170105556 - 
170105787 


TCAATAAATGATTCTGGGGATGTGTCTGTCTGTCC 

ATCTGTCTCTCTCNAGANACANATACACACACACA 

CACACACACACACACACACACACATCCTGTTAGTT 

CTGTTTACCTGGAGAACCTTGACTAACATACCCAT 

TAAAACCAAAATATGTCCTTCAGGGTGTTAATGTT 

TGGTTGAAGAAACACAGAAGTTTAACAATTGTATC 

AGGCTGGGCACGGCCTATAATCCCAGCAT1T1GGG 

AGGCCACAATGAGNGGATCACTTGAGCCCAGGAG 

TTCTAGACCAGCCTATGCAACATAGTGAGACAAAA 

AAATGAANAAAATTAGGGGTGTGGTGGAGCGCAC 

CTGTAGTCCTAGCT 


SEQ ID NO: 
112 


DG5S959 


170167429 - 
170167616 


GAGTTCTATGGAACAGCATTTATTGAATAATAACA 

TTTCAGGAAAAAATATAAGCTTTACTGTATATTAA 

AATACATATATACGTTTATATATTATATATTATATT 

ATATTATATATTATATATATTATATATATTATAATA 

TTTATATATTATATAGATATAAATCAACTACAAGA 

TCCAGTTCAA 


SEQ ID NO: 
113 



Table 8: The Build 33 location and size of KCbJPl exons. 

5 



EXON 


START (NBCI33) 


END (B33) 


Size (bp) 


1a 


169716298 


169716511 


214 


UTR 1 


169848417 


169848523 


107 


UTR 2 


169861083 


169861154 


72 


UTR 3 


169864589 


169864679 


91 


UTR 4 


169867066 


169867173 


108 


1b 


169867120 


169867180 


61 


Ins-r 


170075401 


170075433 


33 


2 


170081305 


170081429 


125 


3 


170082868 


170082937 


70 | 


4 


170084380 


170084450 


71 


5 


170085260 


170085367 


108 


6 


170095347 


170095451 


105 


7 


170096383 


170096445 


63 


8 


170098306 


170098353 


48 



10 



15 



'.NSDOCID: <WO 2004041 193A2_I_> 



WO 2004/041193 



-118- 



PCT/US2003/034681 



Table 9. The Build 33 location of SNPs found across KChJPl after the first round of 
sequencing that was limited to the exons and flanking sequences. 



START (B33) 


MARKER 


VARIATION 




TTPP #*1 a 94QQ94 


Uu 


1607 16900 


T<TPP pIq 9^H097 




1 ^071 fSKl\ 
/ IOJ>Zl 


iss^r _e 1 a__z;> uu4y 


A/C 


1U7 / *+UDOO 


Ti^XTD 9/1999 


A/G 




T^XTD 9 /1 9 CO 


a ir^ 
A/G 


1 ^0741 1 79 
I 07 /HI i /Z 


JsJNJd_z4 /2o 


G/T 




THvTD 9 O O H C 

jsJNr>_zyby*> 


C/T 


1 /^Q7A7Q^1 


JsJNo_J 14y / 


A/G 


1 ^07^ 1 *;$?*2 

10^7 / JlOOJ 


T7"\TD 

PwJNr>_J jzyo 


A/1 


ioy ID I f DD 


TTXTT) OCO *7 A 

JsJN±>_J3J70 


C/G 


ioy /d i /dd 


tr\Tn Q CO Aft 1 


A/G 


107o4o4/0 


KCir_U iKl_iozzOo 


C/T 




JvG.r__U IKl__ibzz7z 


A/C 


1D7001jOO 


rLGr_3 U 1 Rz__3 y 5 06 0 


A/G 


ioyo04 / du 


KCr_3U 1R3_39o480 


C/T 


1 £QB£/1Q7^ 
lOyoO^q /D 


ivUr_i U 1 K3_^3 y 0 oU 5 


C/T 


1 /^QS^^l CI 
IO70OO1 oi 


JvUr_e 1 d_3 y y y 1 z 


G/T 


1 70OR1 9Q1 


VfT) 1 1 C9 


C/I 


170081464 


Jvv^Jr_i JZ4 


G/C 


17008147^ 




a ir^ 
A/G 


1700R97RR 

JL / \J\JO£* I OO 


T^r^T> 9/C/1Q 
JVV^ir zoH-y 


C/l 


170085097 


KCP 4957 


C/T 


170085115 


KCP_4976 


C/T 


170085151 


KCP_5011 


A/T 


170085191 


KCP_5051 . 


C/T 


170085217 


KCP_5077 


A/T 


170085342 


KCP_5202 


A/C 


170095344 


KOM5204 


C/T 



.2004041 193A2_I_> 



WO 2004/041193 



-119- 



PCT/US2003/034681 



170095540 


KCP_15400 


C/T 


170096291 


KCP_16152 


A/G 


170098209 


KCP_18069 


C/T 









Table 10. The DNA sequence of the SNPs identified across KChlPl. 



NAME 


SEQUENCE LISTING 


SEQID 
NO. 


KChlPl 


See FIG. 1 


SEQID 
NO 1 


SG05S872 


TGGCTGTCCCCTCTGCCTGGAGCAGGCTTTGCCCAGATGTCCTC 

CTGGCTCACTCCCTCACCTCCTTATGTCTTGACTCAGAGGTCAC 

CCTTCCAGATTAGACTGCCTGACCCCTTCTGTGCTTTCTGTTTT 

CTCCTTATTACAAATGAATCTGCACCATATTTCACTGATTGTGT 

TTGCTGCATGCATGAGGGCTCACATAAGGATGTGCTTTTTGTCC 

ACTTTGTTCATTGCTGAATCACTAGCACTGACAGCTGTACCTGG 

CACAAACTGGGTGCTTAAGAAATATTCTTGAATCAAGGAATCAA 

TAAATGAATGTTATAGAGAAAGCAGGAGAATAGATGATAATTGA 

GAAAACTGAAGCCCAGAGATGGGAAGTCACTGGCCCCATGTCAC 

ACAGCAGCAAATGCAGAACCGGTCCTGGAACTTCAGCCTCTCAG 

CCCCGGCCCTGTCCTCTCCTGTGCTTCfCACCACTTTATGTAAG 

TTTTTTCTTTATTTGTGGAGCTCTCAGCAGGCATTTTTCTCTCT 

GTGCTCAGTTGGCATTTTTCCCTTGAACCAGCTGTGTCTTCACT 

CTCTTC CC CATTTTCTCCAGAATATGTTCTTCTGTTTAACTGAA 

TGTTCTCTTTTTCTGCAGGTCTGGCCCAACTGCAATATCCAGAG 

ACTTTTCGGTGTCATATGAAAGAAAAGGAGCAGGAAGCCAAGAT 

GCCCCACCTGGCTTCTACATCAGGGTGATCTGCATAGTAAGATG 

CAAAGACACTGACATATGCCTGGGGGTAACGAGGGCAGTGGGGG j 

GAGGGAGCTAAGCCAAGATAAGCCTCCTCCCCACCAAACATAGG 

TGCTACTGAGCAATGATAGGGGGCATGCTGTCTGCTCTGGTACT 

TGCGTAGGGAATGCTCTGAGAAAC CTC ACTAAATCTGC CCTCTA 

GAGTAGAGCAACCTGGGAGCTCAGGCTTCCCTTTCCTCTGTGTG 

ATGGGTTGGCGGTCCTTAGAGCCAGCCATTTC [A/G] TCCTGCT 

CCTTCTCTCCTCCCCTTCCTGACCAATAAAGATTGTGTGCTTCT 

GGCCAGTCAGCAGGGTGGGCTCTCACTCCATCCTGCCTCTGGTA 

TGACAGCACAATTCCCCTCATTCTTTATAATCATTATAAAATAA 

AATAACTACCTTTTAGAATACTTATTTGATATGAGGCACTTTGC 

AAACC C ACAGTCCTGCATATC C CATTTGAC ATATCAGGATGCTG 

GGCTTACAGGTTACCCCAGGGGTGGAGTTGGGCTCAATCCTAGG 

ATTGTCTGCATCTGATTCTGAAGCTTGTTTTCTTTTCCCCTATA 

CACAATCATTCATTCATTCATTCAGTAATTTTTAAATTGAGACA 

TACTATGTACCAGCACCTGTTCTAAGCATTGGATTATGGTGATG 

AATGAGGCAGACAGGGTCCTTCCCACAAATAACTAACTCTATTC 

AAGCAGTGGGAGAAAAAGCAATGAATGGGAAATAAATGCACAAA 

TCAAGTAATGTTGGATGGGACAACTGCTGTGGTCCCATTGAAAC 

AAGCCCAGAGTGAGCCCAGTGTAGGGACTTCTTCATTGACTGGT 

TGGGAATTGAGTGACAATCGGTTGCTGCATGCTGATGGGTGCCA 

AATACAAC CGTAAGGAAACACTCC CCTGGGAGGGAGGCGGGATC 

CAGGTTAGGAAAGAGCCTTGGATTGAGGCAGAGTGTCAGGAAGT 

GGGGAGGTACGCAGCTGACCTTGGAGAAJ\ATCCCTGAGTGGTGC 


SEQID 
NO. 114 

i 



tNSDOCID: <WO 200404 1193A2J_> 



WO 2004/041193 



-120- 



PCT7US2003/034681 





AGATCTCTTGAATCTCTGAGTGGCTCAGAGTCTTCCTGGAAATG 
CAGAAATCCCCATGCCACTTAGGGGCATCTTCATTCATCTCCAG 
CCCTCCTTTATTAAGTCATGTATACCATCTCCTCTCTTATGCTT 
AATGTCATGCCACTCTTCAATCCTTGTCCCTTCTTTCCCTCTGT 
GCCTGCTTGTGGTTTACTCCTGCTGACACCAAAGGCTGAGGAGG 
ATGAAAGAACAATTCCAGC CCTGAC 




SG05S873 


GGTAATTCTTAAGCTGGCTGGGCCTAAAACTGCAAACTGGTATT 

GGGCATGCCAGAAGGTAACCATAAATGGGCTATTTGGAGATTTC 

TAGGAAGAAGAATGACATTTTGTTTCATTCCATTCCATTTCATT 

TCATTCCATTAATACTAAAAATATTAACTAAAGCATCATTTCTA 

CTATATATCCAGAAGAGAACATGGTCTTAGGTCTTTTAATAAAT 

GAACTTCAGTTGCAAACTTTCTGCTGTGACGTTATATTTCTCTT 

TCCACCCTAGACCAGCCCCTAATGGGGCCATGAAGTCAGATTTT 

TGGTTCATGGTGTTGTCGGGGCAGCATAGCCCAGAATTCCACTT 

CCTTCCCTGAGGACACATTTATTCTGGTAGATGTGCTGTTTTCC 

ATTTAAATGTCCTTTGGC AATAAAAGAGCTGGCTCCAACAGCAG 

ACCACGGGGCTGGCTTTGTCGGCAGACACCACGTGTTCATGACT 

GGCAGCTTTGTCTGGAAGAGGGAGCTTTTAAAATGCAGTTCTAT 

GCTGACTCTTTGGAGTCTTCCCAGGAAGATAACTGCTATTGCAT 

TGCATGCTTAATTTAGAGCACCTATTTTTCCCTCTCCTTCAAGG 

TTTCTGTATATCTTCTCAGTTCATGAAATTAATTATTTGGGTAC 

AATAATTGTACAAAGGCACTTTATCAGACACTTCGTATAAATTA 

TTTCTCATTCTCAAGGCAACTTGGAAAGGTCAGTCTAGGGGTCA . 

GCTGCTACTTTTGGTGATCAGGCATCACCCCCTCCTTCCTCTTA 

GTACGTTATGACAGTGGCAAGTG AGCATTAC CTGTGGACCCCAA. 

AGGAGTTCATTTCCTTAGAGCCAGCCATTCCTCAGTTAATCTGG | 

TCTGTCAGACACTCTGTCCCAGGACACTGAGCCTTGAGCATGTG 1 

AAGGTGTGGGCTCTGCTGGGGGCTTGGCAGCCAGCACCTGTCTG 

TGTATCACCTGGCTCCTGCAGCGAGAAC CTGC [A/G] GTGTGAT 

TTCTGCAGCCTGGCCCTCTGAGATTCCATGGCTGCTGACGATTT 

TCCACTTTCCAAGACTGTTCACATTCCCAGCAATTCTGTGAGGC 

CCTGGCCTTCAAAGGTGTTCAATACATTGCTTTTTTTTTTTTTT 

TTTTTTTTTTGAGACAGAGTCTCACTCTGTCACCCAGCCTGGAG 

TGCAGTGGTGCCATTTCAGCTCACCGCAACCTCCACCTCTCGGG 

TTCAAGCAATTCTTCTGCCTCTGTCTCGCAAGTAGTTGGGATTA 

CAGGCACACATTGCCACTTACGGCTTTTTTATCATTATTATTAT 

TATTTATTTTTAGTAGAGATG CAGTTTTGCCATGTTGG C CAGGC 

TGGCCTTGAACTCCTGGTCTCAAGTGATTCACCCACCTCAGCCT 

CCCAAAGTGCTGGGTTTACAGGCGTGAGCCACTGTGCTGGGCCC ' 

CATTTGTTATTTAAGGGAGAGTCCGTTTCTGCTGTTTGTAACTA 

AGGACCTGTCTGATCTCTAGGAATTATTGACCCCAGTTTTCAGA 

TAAAGAAGTTAAGCTTG AGGTTAGAGCTTTTG AG CAAAAACTC C 

TCTCCTAGAGAACTCAAGTATCCAGGAATACTCGGTCAAGGCTG 

GGCTGGACCAGGTCTGTAATCCTGATATTCAGAAAAGGGATGAT 

TTCTCCTCTTTGGTTTGGTTTTCTCACTGAGGCCTGCACACCAG 

A.CAAAGACTGG CAGCTTATTC CTG AGTACAGTTCCACCAGGTAT 
GGCACACAAAGTGATATGAGTTAGAACACAGATGGATATAGATG 
TTTTACAAATGTAAGTTTGCATAACACACACACACACATTGCTA 
TGTGTTAGAAAAATACAATAAGCTCATCTAATTTATTATTTCAT 
GTGTCTTATTGCTCAGAAAGAGGAAAAGATTTTATTGAAGTTGA 


SEQID 
NO. 115 



tNSOOCID: <WO 200404 11 93 A2_l_> 



WO 2004/041193 



-121- 



PCI7US2003/034681 





GAAAAGAAATTGAATTAAAATAATA 




KCP rs.31 
5773 


AGAAACTCCGACTGTCTTTCAGCACACAGAAGACACTGTACTGG 

ACCCGGACATTAGGCAGACACCCACGCCTGACTTTCAGGAGAAA 

AGAGAACATGACTAACGGATATTCTTAGTAGATGGTTTATTAGA 

AAAGAGAACATCTTCCAGCATGTGTCCTGGGGTGATGGGTGTGG 

GAAGCACTCAGTCCATAGTCCTGGTCCCTGGCTTCCCCAAGCCC 

AGCACCATGAATGTACAGTGGAAAGCAGAGGGTGCAGCGTCTCA 

GAAAGATGCTTC CACTCACAAGGATTGGAGCTCACAAGTGAGCT 

CCATAACCTGCAAACCAGAGAAACCTGAGACACTGCCCCTTGGC 

CATTTTATCAACGGAGACTTTATTGTGATTATCCCGGCAGGGGG 

CCGAGCTCTCCTCTCTGCAACAGGAAATGCTCTTTAGTGAAAAT 

GCAGCATTTCTCCAAGGGTAAGAAAGCTGAACGCCTGCTTAGCT 

TATGAACCCTCAGTTGGCCTAGGTGGTGCAAAGACCCTGCTGTT 

ACTGCTTTGATCATCAGTACTGTGGACTGTACCAGGAGATCCCT 

GGGAATGTGCTCTGGGCGGAAGCAGCTTTTATCTTTGGCCCTCA 

CCCATGCTTTATATGGTGAGGTTGGGAAAATGGCACAAGGCTTC 

TCCTGAACCTCAAATCAACACCCTTGCCCCATTTAGATCCTATC 

TGGCTGTTTCTTGCTAATATTACTGCATCACTGCACCATCTTTC 

CTATTTCAGCAAAGTGGAGTCATGTGTGGTTTATGGGGTAGATG 

GACCCCAAAACTGATAATATGAATCAAGCTATGGTGTTTACTCC 

CTAGGAAATGCACAATTTTTCTGGAAACCTACAGAAGCTTCAAA 

TGCATTCGCCATGCAAAGCTAAGTCAGCAGAACAACCCGTTTGG 

CTTTGGAGGCTAGTTCAGTTCCGCGGACAGGGAGAAAGATGAGG 

CAGACTGTGGTTTTTCAGTTCCTGGAGCTTAC [A/G] GAGCTCC 

AAAGCTCCCTCTCTTCCCACCCTGGCTGCACTGTTCTTAATTTT 

AGATAATACCCTGCCTTCTCGTATTGCTGCTGAGCTCCTAGCAT 

CCTCAGTTTATCTGTCTGTGAAATGAAAAATCTAATGTTAAATT 

TTTTACCTATGGCATGAGAGAGATGGCTATGGCTCTTGTGAGCC 

TCTCTGCAGCCCCTCTTTTCCTTCAATCACCCTCTGTCTCTCCT 

GCCTTCTGCTTATTCTCTCTCTCCCCTCATCCCCACTTTCCCAG 

TGGGTCCTCTGTTCTCTTTTTTTTTTTCTTTTTAAATCTCTCTA 

TGC CTCCAGC CGAGAAGATAAAG AGTGTACATCTTTCTGGTTAA 

AAAGTTTTGCTTTGCAGAAACACAGCCAATTTATGATTCTGGCC 

TTCCCAGCTAGGGACAGTGTTCATTTACATTTAGGACCATGAGG 

AGAGAGGCTTAGCTGTGTGTTTCTGAGGCCGGAGAAAATTACAG 

TGATATATAACAGTGCTGCACTCATAGAGGf GCTGAGCCGGGGT 

TGGGCTCAGGCGGCCGCTAAGCTCAGAGTGGAAAGTTTCAGAGG 

GGAGG CAGAAAGGAGAGGTCTATAGCTC CTCCAGATTCTAGGTA 

TTAATTTACTAAGATATTCCTAAGCCAGAAAACAGAGACAGAAG 

ACAAAGAGAAAGAGGGAAGAAGAGCAAGACAGAGAGTTAGAGAG 

AGACAAAGAGAGAGAGTTAGAGACAAAGAGAGAGTGGAGAGGAG 

AGAGAGCAAATATTGAAAGGAAAAGGAAAAAGAAAGAAACCTGA 

CAGCTCATGAACTTTTTAAAAAGTTACAAATTAGATTTGAAGAG 

ATGGGCAGAGGTTTAAGATTTCTTCATTAGGCTGGGTGTGGTGG 

CTCATGCCTGTAATTGCAGCACTCTGGGAGGCTGAGGGTGGCAG 

ATCATCTGAGGTCAGGAGTTCGACACCAGACAGGCCAACATGGT 

GAAACCCTGTCTCTACTAAAAATAC 


SEQED 
NO. 116 


SG05S876 


TATATAC AACCTGGAAGCTCTTTTTC CAAC CATATCACAGACAA 
AbAAAl 1 ^A(jQjL.TTGTACAGGTGAAGGGGCCTGCCTTTCCTTTG 
CTCACAGGAATGTGAGGATGATACAAAAGTGAAGGATATTGGCA 
TTCTTCAGGCAGGGAGATAACCTGGACAGGGGTGGTGCAGCAGG 
CATGTGCATAAAAGGAGCAAGAGAAGCCTTCTCTGTCGTGAGCA 
AGCTTGCAGGCCAGATGGAGAAAAATGAAGTAAAGTCACCCCAA 
AGCCTGGATTCTCATCTGGAGTGCCTCTTGCCTCTTGCCCTTCC 
CAGAACGCTCCAGCTTGGCACTGGGCTGGAATTCCACTAAGAAT 
TGAGTTGATTTCGTCATCTGAGGCCCTGGGCACAATGACAAGGG 


SEQID 
NO. 117 



WSDOCID: <WO 2004041 193A2_I_> 



WO 2004/041193 



-122- 



PCT/US2003/034681 



- 


TGGTTTTCTCGGATCTGCAGTGAGCATTACACCAGAGTGTGGGA 
AACAGTGCCTACTCAGGGACCCCACTCTGGGACCCAGGGCAAAC 
TTGCCATCGTCTCCAGTCAGCTCATTAGCCGCCCAGGACTCTGC 
CAGCCCATCCAGGCAGTGATGTAATTACCAAAATGGAGATGAAT 
ATTTAAAGGGACTCTTACTTAACCGATATACTTCCTCTCCAAGT 
TCCCTCCTTCACCGGCTCTGGATGAATTTCTGGAGGGATTGCTC 
TGACATAGGCCCAGAGCTACCTGTGGTTTGACCTCATCATGAGG 
CCTTTCTTCACCCTTTCTTGGTGGCTTGCCTTGAGGGTGTTAGG 
AGATGGTCCATTGTCTGACTGTGAACAGCAGGGCAGCTCTTATA 
TTCTCCATCAATGGATCTCTGGGGACAAGACCCAGATGGGTGGG 
GGGACAGGGGAAGGAAACATAAAAGCCAAAGGGACTGGATACCT 
GTAACTAATTACCCCTTTACTGTTTCTGTCACCAGACCTTAGTG 
CCACAAAGGATTGGGGGTCATTTGTGACAATGTATGTTGTAAAA 
TGTAAAATGCAAGTGACCACAAATCTGAAAGC [A/G] GTATAGA 
GCTTTGGTTAAAATAATGCAGGCTCTCCACTGGCATTATTATTG 
TTGTTAGGAGAGTCTGGTGCTCTGTTCAAGGGCTTTTCTGTGCT 
ATGGATTATCTCTGTTTAGCACAAAATATCTTGTGTCCCTGGAA 
ACCCCTTAGTCCTGAGAAAACCAGGGCAGTTGGTCACCCCCCTG 
TTCAATGCAGGCATCAGTTCCACTAGGTAGGGGGTCTTAGCTGC 
ATTTTAAAGATAAGGAAATAAAGACTTAATGGGTTGGAATAACT 
GGGTTATGTGCACATAGCTAAAGAATGGTTACACAAACAACTTC 
AAGTC AAATATTAGACCTGCGTATTCCTAAAATCC CTATGGCTG 
TTTGCAATAACTTGAGGCCAGCCTCCCTCTCCTCTTTTCTAAGC 
CCTCTTTACCTTTCTGTGTCCTCTGATGGCTGTTGTTTATCAAG 
GCAACCATCGTGATTCATACCTCAAAGGACGCTTTGAATTCTAC 
TCCTATAGGCTCCAAAACCCTTATTATCCAGGTTCAGTATTGCT 
CTAAACTAGGTGAGGTCCTGAACAGACCCAGATTTCAAGCATAT 
TCAGGTGGATTTGTTTAACAGAGTGTGGCTACTGGAACATCTGG 
AGCCCAAAGTACACAGGAGGCAGGAGAGAGCCTACTTTCCTGAA 
GAGAGGGACGGGCCAACTGTCCGACAATGAGGAGGTGGGCATTC 
TTTCCTTTGTAAAACAAAAAGTATCTGAGACAGGGGTCAGTCAA 
TTCAGAAGCTTATTTTGCCAAACTTATGGACCATAACCCATGAC 
ACAGCCTCAAGAGGTCCTGAGAACATGTGCCCGAGGTGGCTGGG 
TTACATCTTGGTTTTACATGTTTGAGGGAGACTGAAGACATCAG 
TCAATACATGTGAGGCATACATTGGTTGGGTCCAGAAAGGCGGG 
ACAACTTCAGAGGTGGGGAGTGGCTTTTAGGTCATGGGTGGATT 
CAAAGATTTTCTGGTTGGCAATTGG 


- 


KCP_rs95 
2767 


AAAAGTAGC ATCGAGAATC AATTTGCAT C TCAGAATTGGG ATC C 
CTGCCCTAATCTCTCTACTTTATGCGGCCGTGTCCTGCTTTTCA 
TGACTCTAGAAAGCAGAGGAGAAAGTGGATGTAAGATATAAATT 
AGTCTGTCTTGTAGGGCTTTCTCTTGGTCCCATTCTGGGACCAG 
C CAGTGTCC ATACCTGTGGCCTTTGGT AT CCAATTTAAGGCAGT 
TCTTCTCTTTCCATGATCACACAGTAAAGGAGCCCCCGTATACA 
GTGCTCCAGGACTGAGTCCAGTTTTTAGTGTAGCGTGCAACAAG 
AGCAGAAAAGGCAGAGTTGGGAAGGACATGTCAACGGGCAGCAA 
TGAGGTGGTATAAAGACCCTGGGCATTTGGAGGCAACAGAGGGA 
GAAAGGTCTGCTTCAAGGACCAACTTGGTCTCTTCCTATCTCTG 
CCCTGGCAGCACCAGCAGCTGCACATTGGCCCTTCTTACCACTT 
CCATGGCAAAACCAAG [G/T] TTTCTCTACCTCGCCTAGCCGGC 
CCCTGCAGACTTGCTGACACAGCTGAGTGCGGAGTGCATCTAGA 
CCCCAACATGAGGCGCCCTTCTCTCAAAACAAATGAGCCTTCGA 
AACTCCAGCAAACAGTGCTAATGAATTGCCCTCGGCTTCTTAGG 
CATCATTTTCTCGTAATTATAATGGGAAGAAGACATGGAGTCCC 
ACTGAGAACGTGGAGCTAGCCTGCCCCTAGAGCAAGGCAAAATC 
CCTCTCTGAGGACCACACTCAAGCAGAACTGATTTTTCTAAGAC 
TTAGAGAAGAAACAAAATCTGATTTAATTCTTAGGAAATTGCTT 
TTTTTAACCCAC CTGTGTAAG CCTGTATTTAAATGCTAATATAT 


SEQID 
NO. 118 



JNSDOCID: <WO 2004041 193 A2J_> 



WO 2004/041193 



-123- 



PCT/US2003/034681 





TTGGCCTGCCGGGATGCCACATTTATTTTCTTCCTTAGCAGCAA 
CAAAAATCATTTATTTATGAGAATTCTAGCTCCTACCTGCTCTC 
CTGAGTTCCTCATCTTCATTTCCATCTACCAGCTGGA 




KNB_2422 
2 


GAGGGGTTTTTAAGATTGTGTGTTCTGAATGGCCTGTCTCTGAC 
TGGAACCCAACTCCGTCCCCAGACCCACTTCCATCTTTTTCTGT 
GAGGGGGACACT^CTCTTTCT^CTTTTCCAAAATGGCATCTACCA 
TGGCTTTTCTGATTAAAAGCAAACGAAACACACCCTTCCTATAA 
TCAAAAATTTAGAAAAGCAGCAAAAATAAAAAGGGGATAAGGAA 
GAAAACAG AAATTAAC CACCATCCC [A/G] CCGCTAAAATTTTG 
ATGAGTTCTCATGTGTTTCCTTNCAGCTCjATTGTTGTTTGGCAT 
ACATTTATTAA 


SEQID 
NO. 119 


KNB_2425 
9 


CACTTCCATCTTTTTCTGTGAGGGGGACACACTCTTTCAACTTT 
TCCAAAATGGCATCTACCATGGCTTTTCTGATTAAAAGCAAACG 
AAACACACCCTTCCTATAATCAAAAATTTAGAAAAGC AG CAAAA 
ATAAAAAGGGGATAAGGAAGAAAACAGAAATTAACCACCATCCC 
NCCGCTAAAATTTTGATGAGTTCTCATGTGTTTCCTT [A/G] CA 
GCTGATTGTTGTTTGGCATACATTTATTAATATTGGAATTAAAA 
ATATATATGGCACTTTATATCCTAGAAAATAGTAATACTGTAAA 
TGTGTTCTAGAAATGGGAGCTGCTGTTGCTCTTATTAGAGAATT 
CAAACAAAGAAGGGAGGCTCGCTGGGGACAGCTTCTGGGGGAGG 
ATGGGTACCGCTTTGAGACA 


SEQID 
NO. 120 


KNB_24 72 
8 


AGGTATGAGTCAGTTGAGTGGGGACAGGTAATAGAGAGCTAGAA 
CTGGCTGGCCTTATGGCCTCCAAGGCATTGGGGAGCCACTGTAC 
ATTCTTGAGGA.GGCAATGACTTCACAAAAGGATTTCTCAAAGGT 
TAGTCCTGCAACAGAAGACAGCGTGGATTGGACTGGAAGAGTGG 
GAGGGCAGGTGGAGAAGGCATTG [G/T] CTGCAAGTGGGGAGCA 
GCCCTGGGGGCCCAGCCAGTCCCCTGTGCCCTGACAAGTGGTAT 
GGCATGGATGGATGGCTCTACTTCTGGGCCGCCAGGATGGACAG 
GTACTGGTTGCTCTTCACCATGGCGATAATGAGGAGGCCACCGG 
TCAGCAGGAAGGTGGGCCAGAAGAGGGAGAAGAGGAGGGCCTGG 
GGCCCGTAGAGGCGCTGGAAT 


SEQID 
NO 171 


KNB 2989 
5 


CCCCATCCCTCCAGTTCAGTACCTGCTGGTTCTGGTCCCGAGTG 
TCCTCCGTGTGGTACAGCACAGCCCACCTGCCGGCAGCTGACAC 
GTTGACCCACAGGCATGGGTACTGGGGCACCTTCTTGCCCTTCA 
GCT [C/T] CTCCTGGTCCCTGATGTTGGTCTCAATCAGGTGGCA 
CTTGGATTCCTGGGTCCACACGCTGAGGAGACCACACACATGCA 
CACATACACATCTCAGAACTGGGTGACACACAGAACACCCATTT 
GAACCCATTATCCCCTGGGAGCCTCTAGAGGGATCCAGGACTGG 
GCTCGTCATCTTGTCTTCAGCATCCAGCAATAAAGGCACAT 


SEQID 
NO. 122 


KNB_314 9 
7 


TTCCCTGCACTTGAACCCTAGAACCTAAGAATGAGCATCGTCTT 
GACCCTGCTGCCTTG AATGAGGGT CAAGGAGAGGGG TGAGTAGA 
AGGCCAGGGTTCCTTACAGATGCCAGACCCTTAGGAGAGGGTTG 
GGGGGTGGGCAGGCCNGGAGAGCTCAGTACCTTTTCTGGTAGAG 
GGGCAGCACAGTCGTGACCAGGATGTAGTAGGTGATGACGGCAC 
ACACCACCATGGTTACACCCAG [A/G] CAAAGGGCTCGTGTCTC 
TCCCCGCTTCTGGGCCATCACCAGCTTCTTCACCATATTCACTG 
GGGGCAGTGATCATTTCTAGGTCCACAGAAGCAAACAGAAGTGA 
GATCAGCCCAGTTCACAGGTGATCCACAGAAAGAGAGGACAGGT 
GAGAGGGGAAGGTACTCAACTATTAATATCACTCTTGTTTATAT 
TTGGAGCTTTGCAACTTCCAGAAGTCTTGCTTTTTGGACCCCAT 
GTA 


SEQID 
NO. 123 


KNB 3529 
8 


AGAGGAAGGGAGTCCTCCTGCCTGCCTCCCTCCCTGCCCCGTGG 
CAGGCTGCTTTCCCC [A/T] GTCTCCCTCCAGCCCGGTCTTCAG 
AGAAATCACTTCCCAAGTGCTTTCAGGCCCGGTACTCACAGTCT 
TCCCGGCGTCCTGTGGGTCTTGAGCAGCAGACAGTTTCTTTCTG 
CCTGGACCC 


SEQID 
NO. 124 


KNB 3537 


AGCCCGGTCTTCAGAGAAATCACTTCCCAAGTGCTTTCAGGCCC 


SEQID 



f 

tNSOOCID: <WO 2004041 193 A2_t_> 



WO 2004/041193 



-124- 



PCT/US2003/034681 



0 


GGTACTCACAGTCTTCC [C/G] GGCGTCCTGTGGGTCTTGAGCA 
GCAGACAGTTTCTTTCTGCCTGGACCCCCGCCCCCACCCCAAAA 
GAGGCCACAGAGCTTCA 


NO. 125 


KNB 3539 
9 


AGCCCGGTCTTCAGAGAAATCACTTCCCAAGTGCTTTCAGGCCC 
GGTACTCACAGTCTTCCCGGCGTCCTGTGGGTCTTGAGCAGCAG 
AC [A/G] GTTTCTTTCTGCCTGGACCCCCGCCCCCACCCCAAAA 
GAGGCCACAGAGCTTCA 


SEQID 
NO. 126 


KCP_rs31 
4129 


CTTATCTCCACCCTTCACTTGACCCAAGAATCAAAGAACCTGAA 

ACTGAGACTTGGAGGCTTGAAGTCACTGGTGCAACCGTAGGGGC 

CAGAACTAGATTCGAAGCTGGCCCTTCCAGATGGCACAGCTTGG 

TCTGTCTCTGATGACCCTGGGGCTGCTCTGAGACATTAAAAATC 

ACCTCGATCATACAGTAAGCTGCCACCTGAGGCTCTGGAGGTCA 

CCCTGAGTTTCCCCAGCCCCCAGGGAGGTGGGTGCAGCCTGGCC 

TTCCCTGCTGAGCGAGCTCACCACCTTCCTCCCTCCTGCCTCCA 

GCAGGCGCGAAATGAAGGCAGCCACTCAGGCCTCCCTGACACAC 

TCTCAGGCGGTGAGTGCCCTTCTCCACCCCTTTCCTAATTGAAT 

CTTATTAACAGGAGACTACAGTGTCTGTTTAATGGGCACCATAG 

CACCAGAGGGTCTAAGACCAGCTTCAGACCTTGCAGGCAGATTG 

ACAGAGGGATGTAGGA [C/T] CTGGAATTCAATCTCAGAAGAGC 

AATTTTCCAAGGATGATCCTCTGTCCACTCAGAAGCAGGAAAAG 

TCCTCCTGGGGCTAATCCAGAAATGCCAGGCCCCCCTCCTGCTT 

CCCTGGGGGAGAGATACACAGTGCAACAGGCTGCCATTTATGAG 

TATAACCGAAGGGCTCCTTGCTCGTGATACTCTGAATAAGTTAT 

TAAGGGCTACATATTATTTGGAAATCATAAACAAACTTTAGCAT 

TCTTCCCAAGGGAAGGTGGGAACAAACAGGGAAGGGGGGCCGTG 

GGGTCTTCTGCTCCCCCTAAATGAGCCACAACCAAAAGGCATTG 

ACAAGCCCTGTCCTCGAGGGTTTGTGGGTGAAAACCCAGGTCCT 

TTGCTGGCTGCGGGGTTGTGTGTGACAGATGGCTACAGGTGGAG 

GGCAAGAAAATAACAATGCTGCAACAATAAATATTGACGGTTTG 

CATTAGTACGGGGTGTCAGAGATCACAAAATATCTTC 


SEQID 
NO. 127 


KCP_rsl8 
3398 


TACTTTTCAGCCTGAGGTCTCCTCCTCACCACTAACACCCTTCC 

CTCCAATCAAACTGATCCATTGTACCTACAAAAAGCCCGTCCCA 

CCTCCTAGCCTTTGTTCACACTGGGTTCTCTGCTGGATCACCAT 

CCCTCCACATTTCCAGG^GTCCCTCAAGACTACTCAGCAGCAGC . 

TATCCATACAAGTTCCTCAACCCTGGCTTTCTTGCCCTCAAGTA 

ACCAGTTCATCCTCCCCAGTCATATAGCCCTCTATTTACATTTC 

TTTTCTGGAAGCTATCATTTTTCACGTGCCATTTGAGTGAGTGT 

CCTCGCTAAGACGATATTTTCTTTGAGGGCAGTAACCTTTCTTA 

TATGTCTCTGTATC C CATGAACTTAGCAAAAAACAAGGGACAGA 

ACAGGTGCAAAGTCTACGTGGTTAGTGAATTTAACAGATCTTCC 

TAACGTGTAAACGTCGTTGTCCAGGTGAATGGAAGAAGTGAGCT 

GAGATAGAGGGGACAGACAGAGTCAGTGTCCAGTGCTGACCTCT 

GAAATGGAAAAACATGGCCAGTCCTTAGGAGGCTGCAGAGGCCA 

AGACCCCAGTGAGGTTTGGGGGTTCCACAGCAGAGGAGGAGCTG 

TGGACCACAGCAGGACCCCGATGCCATCAGCAGGGGAGGAAGTA 

ATCAGAGAGGTGGAGGAAGGAAGCCAAGGGAAGTCAAGTAAACA 

CCAAATATTCCCTCCCGGTCCAATGCTGTGACCTGCATAAGCCA 

CCACTCCCCCAGTCTAGACTCTACCCATGGAAGAAGGAAGAAGA 

TAGAACTCTGGATTTGAATATAATTCTAAAATAACCAAATTTAT 

CTGAAAATGACTAGGCTGAGTTTTCTGCTTCAACCAGAAATGGA 

GCTTGGAGTCAGAAATTATGTGAAATTATAGAAGAGAAAGTCAC 

CATCTTCCATCTCTGAGTCGTATGATCATTTTAGACATAAAATT 

GTGCACTTACGATGTACCAAGTGCTTAATATA [C/T] GTGATCT 

CATTTC AC CAGG G AAACTGTATAATTC ATTG CTTTAACTGAC AA 

AATTCTGCAACTGAAGAAGGTGCTGTTAATAATTGCATTGGGAC 

GCAGGCCTGAGCAGGCCATGATTTGTGGCTGTCCTACATCTGAC 

CCTCACAGTATCCATGGGAGAAGGCAGCATGTTTATGCCCCCTG 


SEQID 
NO. 128 



MSDOCID: <WO 2004041 1 93A2_L> 



WO 2004/041193 



-125- 



PCT/US2003/034681 





ACAGCTGGGGAAACCAACACTTAAAGTGATTAAGTCACAAGTCC 

AAAATAAATGAC AGAG CTGCAGTTCAAGC CCAGGTGGTC ATTTA 

CCAAAGGCCATGCTCTTTTCACTTTGCATGGGACTGTGACCGCT 

GGCTCTACCCAGCTTCCCAGTGCGACCCTTCCCCGCCCACTGTT 

TCTCTTCTCTGGCCAACGGAAACACAATGAGACCACATATGTAA 

CATTACATTTTTTCATAGCCACATTGAAAAGAAAAAGGAACCAG 

GTAAAATCCATTTTAATATGATATTTTATTTAACCCAATACAGT 

TGAGGCTTGAACAACACAGGTTTGAACTGTGTGGGTCCGCTTAC 

ACATGGCTTTTGTTCAGTCTCTGCCACCCCTGAGACAGCAGGGC 

CAGCCCCTCCTCTTCCGCCTCCTCCTCAGCCCACTCTACATGAA 

AACAAAGAGGATGATGATCTTTTTGATGATCCACTTTCACTTAA 

TAAATAGCAAATATATGTTCTCTTCTTTATGATTTCTCGTAATA 

ACATTTTCTTTTCTCTAGCCTCATTTACTGTAAGAATACAGCAT 

ATTCCCAGCTACTCAGGAAGCTGAGGCAGGAGAATCACTTGAAC 

CTGGGAGG C CGAGTTTGCAGTGAGCCAAAATCGCACCATTGCAC 

TCCAGCCTGGGCAACAAGAGCGAAACTCCAACTCAAAAAAAAAA 

AAAATAAAAAGAATACAGCGTATAATGCATGTAACATATAAAAT 

ATGTGTTAATCAACTGTCTATGTTATGGGTAAGTCTTCCAGTCA 

ACAGCAGGCTATTAGGAGTTAAGTT 




rsl03285 
6 

- 


CGCTCAGCAGCCATTAAAAGGATATCATCCAGTCACTTAGTTTC 

TCAATTTAACTTTAAAGGAAAGTTGCCTTATTAGAGAAGTGGCC 

TCTATTTCAATGTAATGGTCTTTGTCACATCTTCCAATGTGCTG 

GCTTAGTGCTGAAGGATGGGGAAAGGCAGTTTTCACATATTGCA 

GCCACCATACCACCAAAGAAAACAGGTGCACTTCCAGGCATCAT 

TTAGCGGGGTACCA [C/G] ATTCCTGGTTCCAGTTTCCTTTTTA 

GAAAATCTGAAAGTAACTTTGGGGCATATCTTTTAAGGAGTACT 

CCAACACGACTAGTGGACAGACCCTAAATTAATTGCCAATCAGC 

TCTGCCTTCTGGTATTTACACCTTTATGTAATAACCTCCACTTG 

AAGGTAGATGAGATCTGTGACTTGCTTCTAACCAGTGGAATATG 

GCGGAGGTGGTGGGACGTTACTCCTGTGATTACATTACATCATG 

TGGCTCCTTTATGATGGAAGATTCATGCTAGAGATTCTCCTTGC 

TGACTTGACAAAGTATGTAACCATGATGAAGACTTCCACGTGGC 

AAGGAGCTGTGGGAAGCCCAGGTGCTGAGACTGGCATCCAGCAA 

ACACCCAGCAAGAAACAGACGTCCTTGGTTCTACACATACAGGA 

AATGAATTCTG C CAACATCCTGAGTAAGGCTGGAACTAGATTCT 

CCCCAAGTTGAGCCTGACAAGTAAAATACAGACCAGCCAACACC 

TTGATTG C AGT CTTGTGAGACCTGGGGAAAAGGACACAGCTGAA 

CCGTGTCCATTCTTCTGACCCACAGAAACTGTCACATCATAAAG 

GTATGTTAGTTGTTACACAGTTTAGAAAACTATTACAGCTGCTC 

AAGAAGGTTAGCTAGCTCCAGATTTCAATCCATTCACAGGAAAG 

CAAGCTTTATTCCTAGAAGAATAATTCATGCTTTGCAAAAAGAG 

GAAAACGTCCTGCAGTTTTAGAAGGTCTTTTCTTTCTCAACACA 

CCCAAATTTCTTTAAAATCCTCAAGAAGTGCATTTGTTTTCATG 

GTTGACTCGAAGAAGTGAGTATAATTAACTCACAAAAGGTGGGA 

GGAAGGGACAAATTAAATTTTGGT 


SEQID 
NO. 129 


sKCP_rs8 8 
8934 


CACTCAAAGGGCTGGGGACCCTTGTCCCTCCCATGTGCATCCAT 
CTCTCCTATCTCTGAGTCCCCAGTGAACTGCTGCCTCCCTAGAG 
AAACAGTGCTAGAAGTCAGTGGCAAGAGCAGCAGGAGGACTTGG 
x ± vj ibi GAGCTCCGGAGTCAGACCAGCTGAGTT 
CAAGGCCAGCTCCACCATCTATTCACTGTGACTTCAGGAAGGTT 
GCTTAACCTCTCTGTGCCTTAGCTGCCTCATCTATAAAACAGGA 
AACAATGAGAGTCTTTCCTTATGGGGCTATTGAAATGATTAAGT 
GAGATCAGGCATGTGATGGCACACAGTAAGAACTCCATAAACAG 
AGGTCACCACTG CTAATGCAATTATTCTATCAC CTC AGGAGACT 
AAAGCAGGGGAGGAAACACCATTGACTCCTGGACATTTACCCAA 
GGAGATTATGGAT C CATGTTTTG CACACACTTTAGAAAGACAAG 


SEQID 
NO. 130 



NSDOCID: <WO 2004041 193 A2J_> 



WO 2004/041193 



-126- 



PCTYUS2003/034681 





GAATTCTAACCACAGC [A/G] TCTGTCTCCACTGCCCCCGTCAT 
TTCAGTCTCACCCGTCCACCCTCAACCTCACCACTGTGGCCCGG 
AAATGCGGTTGCCCAGGGCCACTCTCACCCCACCTCAGCCCTGC 
TCTGCTCAAGTCTCACTTCCACTCCTTCCAGCTCCCATCCCTTT 
CTACCCAGCTCCACCCTGATTTCTCCACCATGACCTTTACCCTC 
CTAGTCTGATCTAGACCCCTGATCTTGCCGAGTATCTAGGACTT 
TGGTGCCTTTGACCCTCAGCAGCAGAGGTAGAGAGGGATCTCGG 
TGAAGTCTGGGATGTTATAGTGACTTGTTTATCTAAGTGCCCTG 
AGACTGTGAGTTCCCTAATGCAGGGAGCATCAACCTCTGCAGAG 
AGCCCCAGAGCCCTGCTCAGGTGTGATGAACAGGAGGCACTCAC 
TTGATGCCCTCACAAAGTTGTGAGTGAATGAATGAATGAGTGAA 
TGAATGATTGAATGAAGATTAGTGATTATGTTAATGA 




rs905823 


GGTGGGGGGGGAGAGGGGAGGGGAGGAGAGGGGAGGGGAGTGGG 
GGAGAAGGGGAGAAAAGCGCAGCTGGCTTCCTCACTCTCCTTTC 
CTTCCTCACCATCCTTACCCTGGGCCAGGGCAGGAGGAGGATTG 
GCAGAGTAGAGGCAGGGTCTTCTGTCTTAGCTGGGCCTGTTGGT 
GACTTTCTGTTGGCCAACATGGGCTGACTGGAATGTTCTCCAGC 
ATGGCACATGGTCATCCAGATGCAGGCTCTTCCCTGGGGCACTA 
TAGCAGAGAGGGCTCTCTTCCAGTCTATTGCAGATGGATGCCCT 
CGTGAGCTGAGTTTTGATGAACATCCCATGTCCCCAGCCACCCC 
ATTCAGAGCCTCTTTCTACTCTGGTCCTCTGGTCCCAGCAGCAG 
CCCTCTGGGTACTGAGGGGAGGGCATCTCACCCAAGCCCCTTAA 
ACCTGCTCACCTTCTTCAGAGCCCACGTGGCCGCAGGAAAGTCA 
CAAACCCTTGTGCTCCCACAGGGCACACGTGTGCACACGTGTGC 
AGCTACCTTCTCTCTAGTTGGTACCTGAGGCTGCCTCCTGGATT 
TTCCAGTCTCTGTGTTCCCAGACA [A/C] CCCCAAGCCCCAAGA 
ATACAAGAGCTCTGTCACCAAGCATCGGGCCTGTGGCTGCACTA 
CACGTCTGCAGCTCAGGACCCCTGGCTGCGGCGTAAGCTACCAG 
CATCCCCTTCTCATGGGCACCCTCATCTCCGGCTCCCCATCGCT 
GGGCTGTGACCTGCGGGGGCGCCCCTCTATGGAAGGGAAGGAGA 
AAAATTCACAGTGCTATCTACTCCTCTGAATGCACTCCCACCAA 
TTTCCTTGGAAATTTCTAGCTTTCACTGACATATCTGGGATGGG 
GGGGTGGTCACAAAATCA 


SEQID 
NO. 131 


rs883849 


CTGGCTGGGGGACCATGGGTCAGGGCTGCCACCCCCTGGCTCTG 
TGCCTTCACCTGTGTAACGAATGGGGCACTCACAGCCCCTCTCA 
AGTGGTCCTGGGGATGAAGTGAGAAGGTGACATATACAAGTGAG 
TTATACACGTTCCTGTTCTGTCACTCACCAGTGCTCACTGGGTG 
GGTCACTGAACTCCCCTCAGCGTTTCCTTCTCCATCTGTAAACC 
ACCAGTGCAAACCTTTCCCAGATAGTGCTGACCCGAAGCAGGAA 
CCAGTGCCCCTCTGCCCTCAGTAAGTCTGCCAGCAG [A/G] GGA 
AGCCCATAGAGGGTCTTGGGAAATGAAGCCAACAGAGTCAAGAG 
GGTCAGATGATGAGGGACTTCAAGTGCCACCTTCATCCCATTCT 
TTCTGCAAATATTCACCACACACCTACGTGACCTCAGGCTCTGT 
GTCAGGTCCTGGGGATGTAATGGTGTCCATGAAGAAACAAGGTC 
CCTGCCCTCATAGAGTGGCCTGACATATGCCCGAGGCAGTCAGC 
AGCCGAGTGCGGGAGACTCTTGAGCAGAGATTGAGTGTGTTGAT 
ATCTGTAGGCATCAGCCTGGCTTTGCTGAGTGAGCTATATCAGA 
GTGGAGGAAGCCAGAGGCAAAGTCCAGACTCCACTGATCCTGGA 
TTGAGGGGAGAAGGGGCTTGGCGGAAGAGCAGCCTGAGCACCTG 
CATCTCACTCCAACTGGTGCTGATTTGTTCCCAT 


SEQID 
NO. 132 


rs213504 

6 . 


TCCACAGGTTTGATTATAAATGTGTGTATTGAATTGGAATTTCT 
GTTGAAATTCTGATCCCTTCTAGACAAAGAAGGTAAAAATTGAA 
ACATGTCAATGGATATCTAAATATCATTACTCACTGGCTTTATT 
TGCAAATGGCTTTCCATTGACAACAGTTACATTTTGTTCAAAGC 
AACAAATGATTGGCGCTGACAATCCACAGGAACATGGTGCAGTC 
ATTAATGAATGTGCTCATTATTCCTCCCTGCCGGGAGGCATCGA 
CTCCCGTTCTCCAGCCTGTTTTAAGCAGACAGACCTACATCTGC 


SEQID 
NO. 133 



338SDOCID: <WO 2004041 1 93A2J_> 



WO 2004/041193 



-127- 



PCT/US2003/034681 





ACCTGTCAGCTTGGAACCCTAGTAGGGGAGGGGGATGCTGATGT 
GATGGAGAATGAAGAATGGGCCCTGCAGGCTGACATTTTGGGAG 
AGTAGGTTCTGAAATTTATCCCAAAGGACATGGAATCCTGGAAG 
CAGGGTTCAAGATCCTCCCAAAATTGATCTCCCAGGATGCTTGG 
AATGATTGTTC [C/T] GAGGGTTTTGTAAAATGCCAGGGGAAAA 
CCAGGAAGCTTCTCTCCAGTTGTCTTGCCTCCTTCCTCTCCAGT 
CTCCATGGAGCTGACTTTGAGAATTAACTCCTGAGGGACAGAGA 
CCCTGGGATGGAGAGCCAGCCCTGCTGGATTCCACAAGGTGCTG 
CTTAAAGCACAACACCTCTTCCCAATGACAGGTTCTGAAAGAAG 
GCCTTGTAGCTAGATGCACAGAGGGTTTTGTTTTGTTTTTTTTT 
TTTTAACCTTTCAGCATCTGTCTAAAATTGCTCTGGGCTGGGTA 
CAGTGGCTCCCACCTGTAATCCCAACACTTTGAGAGCTGAGGCA 
GGAGGATCGCTTGAGCCCAGGCGTTCTAGACCAGCCTGGGCAAT 
ATAGTGAGATCTCTATGTCTAG 




rs50057 


GGATCTGTGCCTGAAGCTGAGCTGCTGCAATGAAACTGACATTT 
CTGCCTTGCAGCCTGGCCATGGGCTTAGCTGGACTAAAATGCTG 
CTGCAGTGGTGAGGGCACGTGAGAGTCCCTAATGTACATGGCCT 
TGCTCCTTGTCCTGACACATCTTTTAGGGCTGCTGCTTTCTCTA 
GTGCTGGAATCTAGATAATTCCTTTCCCAGCCGTTTGTTTCTTC 
AATCTTGGAAAATATCTGGATGAATGTAACACTGTCACACAC [A 
/G ] AACAGAATTATGACTTACGTCACATTCTATGTCGTGATTTT 
GTGGACTTTTAATAATTGCATTACATTTGTGACCATTAATTTCC 
ACCATCGCCCTGCTCCTGAGAATCTGTAAGGGACATTTGACACT 
CCTCTCCCCACCCACCTCAACATTTGTGCTGACCTGAAGGTCAC 
ATTAAAAA 


SEQID 
NO. 134 


KCP_497 6 


GCAACTTTCCCTGGCCTGCAGGAACTCAGGGACTCAGGGGACTA 
ATAACAACAGTGTATGAGCTTCCGGGCACACTGCTTCCCAGTGG 
CAGCCCCTGTACTTAGGGCTTTGTATGTATTAATTCATTTACTC 
CAATTCCCACAATAACCCTATAGGGTAGGGTTTTATTATTGATT 
ACCTTTTTACAGAAGAGGAGAGTAAGGCAAAGAGAGATAGAGTA 
GTTTTCCCAAGGTCAAAGAGCACATAAATGATAAAGGATGGATT 
TGAATGTAGGCAGAATGACCCTCAATACAGACTGTTCCTACAGT 
CCACGTCCTCAGCCACTAGACCATACGGCCACTGGGATGATAGA 
CAGACCACTGCAGCCATGGATAAGGCAAAAACAGGGCTGGCTGT 
GTTGATCTGTGTCTCTCAGAGCTCCATTCTTCCTCAAGGGGGCA 
CCTTGCAAAA7VAAAACAAAAAAATGGGGCAGGGTAGGGAACTGA 
AGGCAGGAGCTCTTCA [C/T] AGAGCATAGCCACATCCTCCAGG 
CAGACAAGAGGACGCAGGAGGCACCATTCTGTGAGAGTATCACA 
GTCTGACCCAAAGACACAGCTTCACACTGTCTGATGGCTTGATG 
GTTAATGTCACTCTGCCTTTTCCCCTTCTCAGGACTTTGTAACC 
GCTCTGTCGATTTTATTGAGAGGAACTGTCCACGAGAAACTAAG 
GTGGACATTTAATTTGTATGACATCAACAAGGACGGATACATAA 
ACAAAGAGGTAAGTGAGCTGGGGCCAGGGGTGTGAGAGGGCTCC 
AGTGAAGGTAACTAACCCAACAGAAAACAGCCCCAGGCATGAGG 
ATAGCACTGTCTGAATGAGGCAGGCTCTGCTTTGGGGCTAACAG 
AGCTGGTCCCTGGCAAAATAAAGAAGGCCTCCCTCATTGCCCTA 
CCCTGCCCTGTTCCCAAGCGCCCAGAAAGGATTAAACAGATTCA 
TTCTCACTGGGTCAC CTAG ATTCAGTAGATATTAC AC 


SEQID 
NO. 135 


KCP_5077 


TAGGGCTTTGTATGTATTAATTCATTTACTCCAATTCCCACAAT 
AACCCTATAGGGTAGGGTTTTATTATTGATTACCTTTTTACAGA 
AGAGGAGAGTAAGGCAAAGAGAGATAGAGTAGTTTTCCCAAGGT 
C AAAG AG CACAT AAATG ATAAAGG ATGG ATTTG AATGT AGG CAG 
AATGACCCTCAATACAGACTGTTCCTACAGTCCACGTCCTCAGC 
CACTAGACCATACGGCCACTGGGATGATAGACAGACCACTGCAG 
CCATGGATAAGGCAAAAACAGGGCTGGCTGTGTTGATCTGTGTC 
TCTCAGAGCTCCATTCTTCCTCAAGGGGGCACCTTGCAAAAAAA 
AACAAAAAAATGGGGCAGGGTAGGGAACTGAAGGCAGGAGCTCT 


SEQ ID 
NO. 136 



NSDOCID: <WO 2004041 193A2J_> 



WO 2004/041193 



-128- 



PCT/US2003/034681 





TCACAGAGCATAGCCACATCCTCCAGGCAGACAAGAGGACGCAG 
GAGGCACCATTCTGTGAGAGTATCACAGTCTGACCCAAAGACAC 
AGCTTCACACTGTCTG [A/T] TGGCTTGATGGTTAATGTCACTC 
TGCCTTTTCCCCTTCTCAGGACTTTGTAACCGCTCTGTCGATTT 
TATTGAGAGGAACTGTCCACGAGAAACTAAGGTGGACATTTAAT 
TTGTATGACATCAACAAGGACGGATACATAAACAAAGAGGTAAG 
TGAGCTGGGGCCAGGGGTGTGAGAGGGCTCCAGTGAAGGTAACT 
AACCCAACAGAAAACAGCCCCAGGCATGAGGATAGCACTGTCTG 
AATGAGGCAGGCTCTGCTTTGGGGCTAACAGAGCTGGTCCCTGG 
CAAAATAAAGAAGGCCTCCCTCATTGCCCTACCCTGCCCTGTTC 
CCAAGCGCCCAGAAAGGATTAAACAGATTCATTCTCACTGGGTC 
ACCTAGATTCAGTAGATATTACACAGTGGATAAAAATGACTTGT 
TTCAGTGTGAAGAGTTACTCTTCCCTAGGGAACCTGCATTTGGG 
AAGGTTAGGAGCCACAAGTCAAAGCTAAAAGTTGAAA 




KCP_2410 
99 


TTGAAAGAGAGCGCTTTGGGGGGTTTTCTTACTGTATGTCTCTA 
TTGCATGTTCTGTATTTTACATTTTTCTATTATTTCTTCTCTGA 
GGTATAGTATTGAATGTAGAAAAATCCTCAAATGTTCGGTATTA 
AGCAATACACTTCTAATTCATGGTTCAGAGAAGAAAATATCTCG 
AATAAAAATAAAATAAAAATATGACTTATCAAAATTTGTAGGAT 
c lAHrtuLAoiAi iLUftuuAAioLftAurljl J_ bu 1 X I AACATTCAAT 
AATTGGTCAGTGTAATTAATCACATTAATAGAATAAAAAGAGAA 
AAAATATAATCATTTCAGTGGATGTAATTGTTCAGAGCTTCTTA 
AAAGAAGCAACTCACTATTTTACTAGATGATTTGTTTCTTCTGA 
ATTCCTCTTTAAGGCTACAGGTGGTGCTTCTTACTTTGAACTGA 
TCACTTTCTAGGTCCCCACCCTTACTTCTTGTTTTTCATACCCT 
TGTAGAGTTTTCTCCA [C/T] ATAGGAAACCCATGCTTGACATT 
TG CTCAC CAGAGTTACAGAGCTCTCAGGGAGGAGACTCAGAGTT 


SEQID 
NO. 137 




AGGTCTCTCCAGCTGTTTCAAAATGTCATGTAACTGGTGACACT 
CAGCCATTCAGCTCTCAGAGACATTGTACTAAACAACCACCTTA 
ACACCCTGATCTGCCCTTGTTCTGATTTTACACACCAACTCTTG 
GGACAGAAACACCTTTTACACTTTGGAAGAATTCTCTGCTGAAG 
ACTTTCTATGGAACCCAGCATCATGTGGCTCAGTCTCTGATTGC 
CAACTCTTCCTCTTTCTTCTTCTTGAGAGAGACAAGATGAAATT 
TGAGTTTGTTTTGGAAGCATGCTCATCTCCTCACACTGCTGCCC 
TATGG AAGGTC CCTCTGCTTAAGCTTAAACAGTAGTG CACAAAA 
TATGCTGCTTACGTGCCCCCAGCCCACTGCCTCCAAG 




rsl89530 
1 


TTTTTTTTCCCCAATCATGCTGTATTCTTAGCGTAATTTTAAAA 
TACTTAAAACAAGATCATGAGAAAATAAATGCCCAGATTCTAGC 
AC CAAAATTCAGAAGGGGGGGCTATGAGAATGAGGGGCGGGGAG 
AAGCCTTCCTGAGAGTTTCTAAGAGGCATGGAGGCAGTGGGGAT 
AGTGATTAGCTCTGGGGGAAGAAGAGGCTACTGGCTGGAAAAGG 
GCATGAGGTAGGGTTGGTAATCACCTA [C/T] TGTTTTATCTGA 
GTGCTGGTCACACAGATGTGTTCACTTTAGGAAAATGTATTGAG 
ATTACACTTGTGATTTCTGCATTTTTACATACGCACATTAACTc 
agtcatatgctgataaatgtttaacaatgggtttgctggagaaa 
aaagggtcccccggatttgtaatgtctgcccatttccgtggtgt 
aaatactcccttcacaactgatttcaagcttcccatgcactgta 
actgaagacagagttgggaagatacgtgcagtagcacaacatta 
aatcatatttccaccatatacacacaataggtgtaaataacacc 
cagagcatagaaaa 


SEQED 
NO. 138 


rsl42275 
2 


GGTTGGCAGCTTTTAATAACTTAGAAATGGCTGGGGGTGGGGGG 
GAGGAAGTACTGAATCATTTACTCATTCAGCAAATAACCAGGGA 
ATACCTACTCTACACTGGTCACTGATGGAGATA [C/T] AGACTT 
GGGCAAAAGCCGCGTCATCTGGTTGTGTTCAAGCTGAACATTCC 
CTTGACCCAGTCACTGATGGAGATATAGACAGGCAAAAGCCACG 
TCATCTGGCTGTGTTCAAGCTGAACAGTCCCTTGACCCAGGGCC 


SEQID 
NO. 139 



tNSDOCIO: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-129- 



PCT/US2003/034681 





CATGACAGGGCAGAGGGCAtattattatccccattttacaaagg 
aaagagctgtcagacaca gTGTCACACAGGAAGGTAGACGATAA 
TGTCAATATCCCTCATCTTAGTATAAAGTTGTCCTTAAAAACTC 
TCCATTATTTATTAATTTATTGACTCACTTATTCATGTTTTCTG 
CACAGTGATACTTATCCTGCACGAGACTCTCACACCAGTGCTTT 
GGGTGTAAGAACACCCCAAGGATTGTGTTCCCTTTTCTCGAAGA 
GTCTGTGGTCTAAGGGGATTCAATGGGGTCCACTTTCCAAACCA 
AGACAGCAAAGGAACACTAGGAGAGAAGTATTCTGTGCAGAGAT 
TCAGTTAT 




rsl42275 
4 


GGATTAacaggcatgcaccaccgcacctggctaatttttgtatt 
tttagtagagatggtgtttcaccatgttggccag [A/G] atggt 
catgatctcttgaccttgggttctacccacctcagcctcccaaa 
gtgctgagattacaggtgtgagccgctgcacccggGCAACTGGT 
TTCCTTTTACTGCCACTTTCACTAACCGTGGTATTTCTCCATGG 
GCAGCATTCTTGGCATTTGGGTGTGTAGGACTGTCCCTCACATA 
GTGACCTCTTACTCATGAATTGCCAGTGTCACATTCAGATTCTT 
ATGGCAAC CAGAAGCTCC CCTGCTCC CAG CATTTCTGGACTCAG 
CCTGGGCTGGGGAGGTTAGCTCAGACCAAATATCTCCTTTCTGC 
CAGTTGCTCTGCTAGGCCCAGGTCATGCTGAGCAGAGCAAGATG 
TAGCTGAAAACCAAATAAGTCACGTGTTCCAGCTTGCTGGGGTT 
TTGTGAAGAAAGCAGCCACCCCTCCAGTCATATAGTTTGCAGGT 
TGGGATTTG CATT 


SEQID 
NO. 140 


rs205560 
6 


tggctattgtcttaagctactattaccttcttgcttgtcaagtt 
gcgcatttacttttcaaggcttgctacgtgcctggaatttctag 
attttcctttatttccatgcttggggagaggagtgcctggcagg 
ctcctaagaggggtctgtgctccatctcGCCCCCTATCTTGAAC 
TATCGGTTGGGTGCTCTAGAATCTGTATGGGGTGGAAGTGTTCA 
TTGATTTTCTGTACAAAAGCAATCAATGCTTATTGTGGAAAACC 
CAAATAAGAGAGTTG CTCTAAACAACACC CTCCCC AGTCCCAAT 
ACCTTGTCCAGAAGAAACCACTGTTTGGTGAGTATATTAGT [C/ 
T] AATGTCTGCAGACCAGATCGGATGACCAAGTTTTCCATAAAT 
GGATGGCCATCCACTTCCCTTCAAGGGCGAGGGTAGTTTGTTCT 
GATCCATCTCCCTGTTTCACAGCTCAGGGAGGGAGGAAGACCCA 
GGAAGGAGAGCTGCCACAGTTACTAGTGGCCCAGCTGGGATTTA 
AAGTCCGCCGTGACTGAAGCTTGGCTCCACATGCCAGTCTGCAA 
GGCCCTGAGTGCCCTGAGCAGTAATTCCAAGCAAAGCAGGGAAG 
CAGCGGGCCAGGTGCTGAACTGAACTGCTGCTCAGGGCTCCTG 


SEQID 
NO. 141 


rs933656 


CTGCATATGTTCCCCCAGGTATTTGCCCCCGAAGCACAGTCATC 

TCACTGCCTTGCATAGTGGAATGCTAATCAGCAGAAGACCCTTC 

TATGGGAGGCAGCTTGGAAACCTGGAGGAAGCCCTGGCTGAGGA 

GGCTAGTGGTCAGGGAGCCTATCCTGGCCAGGTCACTTTTCCCC 

ACTGGGGCCTCGGTTTCTTCTTTGTAAAGGGAGAAACTTACATT | 

AGGCATTTCCTCAGGTTCCATTTGGTTCTCAAATTCTAATATTT 

TTATGGTTGATGCTCTCACCAGAGCTGCTGCTATGATCTCAGAG 

ACGTGAGGCTCAGATCTAATTAGAAGCAACCGGAAGAGAGCAGT 

TGGGATTTTTCAactcaggaatcagtctccctgctgggttcaaa 

ttcaggctctgccacttacagctgtatgacTAAGCCTTGTTTTC 

CTCAACTATAAAACAG [A/G] GATAGTAGTAGTTACCATCTTAA 

n^ 1 Aiju ibiJ.bJ.UlX GTGTGGATTTCAAGGATCATGCAAGTCAA 

GCATTTAGCACAGTCTCTGCTACATAAGTGGTCAGCAAATTTGA 
GGTACTATTC 


SEQID 
NO. 142 


TS233909 
1 


AGACCCTTCTATGGGAGGCAGCTTGGAAACCTGGAGGAAGCCCT 
GGCTGAGGAGGCTAGTGGTCAGGGAGCCTATCCTGGCCAGGTCA 
CTTTTCCCCACTGGGGCCTCGGTTTCTTCTTTGTAAAGGGAGAA 
ACTTACATTAGGGATTTCCTCAGGTTCCATTTGGTTCTCAAATT 
CTAATATTTTTATGGTTGATGCTCTCACCAGAGCTGCTGCTATG 


SEQID 
NO. 143 



tNSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-130- 



PCI7US2003/034681 





ATCTCAGAGACGTGAGGCTCAGATCTAATTAGAAGCAACCGGAA 
GAGAGCAGTTGGGATTTTTCAa c t c agga a t cagtctccctgct 
gggttcaaattcaggctctgccacttactagctgtatgacTAAG 
CCTTGTTTTCCTCAACTATAAAACAGAGATAGTAGTAGTTACCA 
TCTTAAAATAGCTGTTGTGTTGTGTGGATTTCAAGGATCATGCA 
AGTCAAGCATTTAGCACAGTCTCTGCTACATAAGTGGTCAGCAA 
ATTT [G/T] AGGTACTATTCAATTTATGGCTCTATTGTTTGGGG 
CTTCGAAATGTCCAGAGTAAGGCCATTTTCGAAGTAGGCAGTAC 
ATCTGAGAGCCTTAAC AGCTCATTTCTGGAAAC CTTATCCAGC C 
CTATCCAGATAACTAGGACCAAAAACCCCAGCACACAGATGCTC 
GTCCCTTGLTTCAACCCTCACTGACCTCTAC lClvjltaljt.. 1"1 GTC 
CTGAAAACATCAAAGCCTGCTCAATTAAAATCCTGAATGCCTTG 
ATAATACAATTTAGAAACATACATAGTTTTTAAATAGGGCAAAA 
ACTCTGCATGATTAGTGCTGCAAGAAGATATCCAGCCCAACCTG 
GGTGTTCAGGGAGCGCTCTCTAAAGGCAACAGAAATCTAAAGTA 
ATTTAAGAGCCATGCCACTGAATAAAAATATTCAGGTTCATTTC 
CTGTCCTTCTCTCTGTTTGGGATCTTTGTGTGTCTTTAATTAAA 
AGTAGGAGAGCCCTGCTTTT 




rs!86233 
1 


ACTACTTCTAAAGCCTCTTAGACCCTGGTAATCTTCCTCCTAAC 
ACCATCGGGTGACTGCAAAGCACTGCAGGCCAGACTTCAGTTCT 
GCTGTGTAATTTGCAAGCTGGGTGACCTTCCTTATCTATAGAAT 
GGGCTCT [C/T] CTGCATGGCTGGCATGAGGAATAAACAAAATG 
GTTGTGTCCAGTGCCTGGGGCATAGCACAGCTCAAAAAACTTAG 
TTCATCCTCCTGAGGGATCAAGAAGATACTTGGAAACAAATGTC 
CAAGGGCGTAATCTTGAAGGGGCTTGTGCCAGGCATATATGGAG 
AGAAGGGTTTTGTGGGATGTCAGACTTAATAGTGCCCTTTACTC 
CCCACCCCCGTCTCTCTGTTCATAGACAGGAAATCTGTGGCCTA 
TTCTGGGACCTCAAAGTGCCACAGGGTTAAAGATACCAAGTCAG 
AAATCTAAGGTTCTAAATGGACTTTAGACCATTTTTCATTTGGG 
AAGGAAGAATTCTTTAAGGGGTTGTGCTGGCGCTGTCTCTGTAT 
GCATGTGCAGAATGTGCTTCCAGATGGGGTAATGGTCTGAGTTT 
GAGGACAGAAGTCCACTCCACTGCATTC 


SEQ ID 
NO. 144 


rs233913 
9 


GGGTGTGG C CTTTGGACAGCACCTTAGCAGGAATGTGGTGGAGA 
GCAGCC C C ATTCACTCCAG AGGAGAGCCTCAAACTCTTCAGG CA 
GATCTAGCCTAGGTAGAATCTTGGCCTGGCCCCTCCGGGATGAC 
AGGTGCCATTGCCCAAGAATGGGGAAAAGGCTGAAGTGCTCCAG 
CCAAAGACCCCAATTTATCTTCAGGACAATTTTCACTGGAAACC 
TTGCCTCACCACTGCCCACTTTTTCAGAAGTAATTAGAATGCTA 
ATCTATAAGAAAGATGACtattaaaaataaattaataataGATA 
ATACATTTTGGCTTACAATTTTGAATAATATAGCCATCCCATCT 
TAAAGTAAAAATTCATATATTTTTAATAAGCCTGAGACATGTTT 
TCCAATGAACC ACAG ATGGTT CATTTTTATTAT C CT ATAAAGAG 
ACATTATGGGCAAGTGTTTTTTAAAATGGTAAAACAGAACCTTA 
GAGCAGCTCTCTTTTG [A/G] AGATCTCTAAGCACTTTCTAAGC 
ATCAGGACCCCCTTCTGTCATCACAGAGACTGAAATGAGGAGAT 
GGTCTCTGTCACCCCCTCACTCACCAGTGAGCCCCAGACCTTCA 
TCCCTGATCAGATGGAAGCAGTGTGGCATGATTACAGTTCATAT 

1 ILAftU i. L 1 oLLAL 1 LAA I UAL i AbLLAAbUAL 1 1 b 

CAGAAAATGTAAATTTAAAAAATAATCTTCCTGAGATTGGTTAT 
GAAATGCACTCAACACAGCACCATCCACAGAGAGGTTCTTTTTA 
ATTGCTCTTTTCTTTCCTCTCGACACCCAGAATCACAAAGCATG 
CCTGAAAGC GT C AC AC ATATATGTCTGTGACCATAACATGGCAT 
TGCACATGCAAAGGAAATAAATAGGTGTTACCCATGTGACAAAG 
GTCCATGAGCTCTGTCCGCAAAAAGCTGTTGAGTTTAAAGAACA 
AATAATTCTGAAAAATCTTCCAG 


SEQ ID 
NO. 145 


rs872435 


CTGCCATTCTGATCACTGCAAGACCCCCACCCCCAATACTCCCA 
ATTGTACCACCCCACCCCACTCACCAGTGTCTCAGAAATGCCTC 


SEQ ID 
NO. 146 



3NSDOC1D: <WO 2004041 193A2_I_> 



WO 2004/041193 



-131- 



PCT/US2003/034681 





CTCCAGAAGGAAGGCATCCTGTCTAACCCACTGCTTCTAGCCAA 
GCTGTCTTTCTTCAGAAGGTAGAAAAA [G/T] ATTGTTAGTCAT 
TGTTTAATCTTTATTGAGTATATACCGCCACACCAATTGCACTG 
CCATTCATTATCTCATTTAAATCTGACAAGAGCCTTGTAAAGTA 
GGGATTATTCCCACCATTTCCCAGATGTTGAAACTGAAATTGAT 
AAACACGACATGTTGCCATGGCTACATGAAGATCTCCAAGCCGG 
AGGATCTCCACCCTCACCTGCCTAGCTTCCCAGACCTCTCTGCA 
GAAAAGGGACTGACCCCCAAGACAGCCCTGGCCTCTGGGCTCCA 
CCCCTTCCACATCCATCCCAGGGCCGCTGAGGACTGAAGAGTTC 
TCCACGTTTGCCCTTTAAAGTGACTTAAAAATAATCTTTATGAA 
TTTCTTCATATACAAAATTTGTACTTACTCATTGCAGCAAATTT 
AGAAAATACACATAAGCAAAAAAGAACGTAACAGCCATC CATAA 
CCCTAACTCTCAGAGATCACCACTATTAAAATGTTTATTATCTA 
AGAGAGAGATGATATAGACAAAGATGAGACAGATTGACACAGAC 
AAGATGGGTACATGATAGATATTTTCTGTGTTATAACCCTTGCT 
TTTTCTTGCACTTTCTAGAATTTTTCTGAGAACTAATCTGAAAT 
CTGCACAGGGTCCCCACGTTTGGATCCTCTATCCCATTGCCTTC 
CA 




rs329468. 


AGCTGAGCCCCAGGGCTCCCCCATGAGTGGGGAGGAAACTCATG 
AGTGCCTTCTATATGCCAGCGCTCTATCTGCAGGGGTTCTTTTG 
ATAGCAGCAGACTGAGAGATGATGTTACTGTCCCCTTTTTCCTG 
TTGTTGGCAACTGAGACTCAGAGGATGGAAGTGACTTGCTCAGG 
TCCACCACCTCTTCAGCTGTGGAGCTGCGACAGGAGCCTTTGTT 
TGACTTCAAAGCTCACCATCACTCCTCTCTCACTGATGCTCAAG 
TGGGCTATCACCTCGCCTTTCCTGAGCCTTCCTTCGCTATCCTA 
AAACAGCGCCTCCCGaaatcaccactaaagaacttattcatgta 
accaaacaccagcggttcccctaaaaacctatggaaataaaAAT 
TAAAAATAAAAACAGTgcctcccatgacccatgtctctccagtc 
ccataactctgctctatttccattcacagctccatccccacctt 
tatgtcttttgttcactgctttatccccagtgcctagaagagtg 
cttggcacctagtagacactcagtaagtatttgtcgaatgagtt 
aatAAGGTTGTGAAAAGAACGTTAGATTACTGGAAGGATTCATC 
TGAGTTTAATTCTGCTATGCTGGGAATCCAGTGTGCGGCCTTGG 
ATGA [A/G] GCCAGTTCCCTCCCTGGGCCCCAGTAGCCACATCT 
GTACATtTAGAGGGCAGGAGAAAAGCCACACGCTCTGTGACTTA 
TACAACTTGTTGCCCAGAGTGGAGGCTGCTTTGATGCTCAGAAA 
AAAGAAACAAACATGGAAATGCTAAATGGGTGGCAGAGAGCTTG 
AGGGAGGAAGGAGATGGGGAGGGTACTCTTGAAACTGTTTGGTG 
TCTTCCCTCCTGCCCCCTCAGTACCAA 


SEQID 
NO. 147 


rs50364 


GCCTGACAGATTTTTACTGAAGGGTGCACATTGGAATAAAAAAT 
GTGTTACCTATCTGGTTGAGTCTTCAGCTTCAGAAAGGTAATAG 
AGCAAAGGCAGATAAATCCAAACAGGGACTGAGCTGTTTTCATG 
CAGGCTGCCTTGGTAGCTCTCCAAAGCCTTCAAAAATGATGAGA 
TTTTTTTTTAAATCCTTTTTATCC [A/G] GTTGTTCTCAAGGGA 
TTCCACCCCTGCATAGGAGAGCTCACCATTCCTGGGATCTTCAG 
CTTCTATGCCTTTGCATATGCTCTTCCCTTGTTCCctcattCtt 
caacactcaactgaattatcacctcccttgaagccttctctgac 
atcccTTCTAGTCCCATGCCACCCAGGAGGCACTAAGAGCTTCC 
TCCCCTCAGCTCCCAGTTCTTAAACATGTCAACACTGTTTTGAA 
ATGATTTGCCAATGAAAAATTCTAGAC CAGCAAC CAACAAc a t C 
cttcccaaaggtgtgttatatatggtacatgctctatgtgctaa 
acaccaaattcattgataacagctaagaaccaggaaacaaacca 
tcgttaattatggcatctcttgaaaaatctaaagatctggactc 
actgggcttaaatgactgcatgataacaactggttgagtaacaa 
ctgtttccctttcatggagcagttactctccagttctcagttcc 
taccactctctatagttgtacactcatcatctgtcctcatctga 
attacctgccaatgactactggcatttgagtttctaatccatgG 


SEQID 
NO. 148 



NSDOCID: <WO 2004041 193A2_I_> 



WO 2004/041193 



-132- 



PCT/US2003/034681 





TCTATGTGTATGCCTCCTCACCAGTGTGAGAACTCATGTAAACA 
GGTATTATGTCTTTTCATCTCTCTCCTAA 




rsl55158 
3 


ATAATGGTCACGTTGGAGCAATTGCCATTTCAAATCATTAGGAA 
CACTCAGGTCACTTTGGCATGGAGCTATTTTGTAAAAGACGTAG 
AAGCCATTTATAAACTTTGGTTTGCTTTTTAAAAATTTATTTCA 
TTCTGAGGCTTATCCGTGTAAAATTACCAAAATGATTGTGGTTA 
GACTCTACATTGTCACAGTATTTAAATGTGCACAATATTCCACT 
TAGAAATAATGTCAGTACTAAAAGTAGTAGAGGGCTTTGATAGC 
AATATTAATACATCGTTAAGCCCTTCTCATTAAACAGTGTAATA 
GTCTTGTTGAAGTTTGTTAGGCATTTTAACCACTACTAATTAAA 
AATAGACCTACTGACTAGTCTGTTTTACTGTGCTTTATTGTGTC 
TTGGATGTTCATTCAGATACTTTTGCTGTTGAGAAATCAAATCG 
TCTCTTATGGTTTTAATTACAAAATACATATTAGAGGGATACAG 
TTCTTAGGGCTGTGATTTTTAATTTGTGTAACCTTTTTTTATTT 
TGGAAAGGAAATTTCAGATTTTTTCTAGTAATTTTTCATTTGTG 
AGTGTTGTTTTCTAGATACAGAAAATGTACCTAGATAGATGATC 
ACATTTTAGGATATTTTGCTTACGTGTTATTTTATATTTATATA 
CTATAATACCATTGTATAGTTCAGAACAAGAAAATATCTTGATA 
AATCATCTGCTACTGTGAGGCAGTTAAAAAAATTTGAGGCTCAC 
TGAAAATGTGTGACTTGCCCACTGTCTCATATTGCTAGTATTGG 
AGAGAAAACTAGAATCTAGGCCTTTATTTTCCTGATGTAATGAT 
TTTAGCTAATTATTATTTATTTTCTTAAATCATTGCATTAATT [ 
C/G] ATTTTTCAC AAGTAGAGCCTATATCAGTGTTTGC aataat 
taaattttaagtatatttctataattgtaaataaaatCCTGACA 
TTTGTTACAGGATGGGGTTTTCTTTCATCatatttttataataa 
aaattaaGCAGTTATAAAAATAAATAGCCTAGTTTTTCAATTGG 
TATAAGCTGGCTTTATTTTATACTGCTAATAAAGGCACATTATG 
TTCAAGCA 


SEQID 
NO. 149 


rsl457S9 
2 


CttatatattcattaattaataatttatattCACACAATGATTG 
TAGAAATGTGAGTGTTTCTTAGATTAGCAAACATCTGTGAAATC 
GTGAAGGAGTATTGAAATTTAGTAATTTGGTTTGGATCTTTGAA 
GATATTCTGTAGAATTGTTTTCCAAAAGTTACAACTGGTTTACA 
ATTTTTTTCTTAATTGCCATTAACAAGTTTTGACCCTGAGATGA 
GAAATTATTCACAAATTTCAATTAAATACTGGAATGCTTCATAT 
TTTCTGTACTTTAGGAcagggatccccaacccccaggccacagg 
ttggtactggtttgtgacctgttaggacctggactacatggcag 
gaggtgagcggtgcgtgagaa [A/G] cattactgcctgagctcc 
acctcctgtcagcgacagcattagattctcataggaggacggac 
cttattgggaacacacacaagagatctaggttgcggactcctca 
tgagactctaatgcctatgatctgaggtgggacagttttatcct 
gaagctcccccactatccgtccagngaaaaatttggtcccttgt 
gccaaaaacactggggacctctgCTT 


SEQED 
NO. 150 


KCP_103 5 
5 


AGGGCTGGGCGTCCCCCGCCCCCACCGTGCAGCCCTCGCCCCCG 
CCCCGCCCCTCCGTAGTTGCCCGCCCGCCGCCCCCTCCGCCGCC 
CCCTCCGCCGCTCCGACTCTCGCCCCGAGCGCTGGCAGCAGGCA 
GCAGGCAGCAGGCGGGCGCGCTGTGGCTCCGCGCCGCGCGGTCC 
GGGCTCTGTTCATTCATGATTGGTACTCGGCCCTCCGAGACCCA 
GCCCGAGCGCAGGGAGGGGAGCCGAGTGTGCGGCAGGAGGGGCG 
GGCGGACGGCGGCTCCCGCACCGCACGCGGCGCTGGCTCGGCAG 
CCTCGGCCGGGCGGCCGCTCTGGCCCCGTGTCCAGTGCCAGGCA 
GGCTTCAGGGCACCGTCCTCGGCCCTGGGCGAGGGAACCGCCGG 
GCCGGGTCCTCGCGCGGGGAAGCGGTTCCGAAGGCTCGCGGGGA 
GC GGCTAG CC CTGAGT CC CTGCATGTGCGGGGCTGAAGAAGG AA 
GCCAGAAGCCTCCTAGCCTCGCCTCCACGCTTGCTGAATACCAA 
GCTGCAGGCGAGCTGCCGGGCGCTTTTCTCTCCTCCAATTCAGA 
GTAGACAAACCACGGGGATTTCTTTCCAGGGTAGGGGAGGGGCC 
GGGCCCGGGGTCCCAACTCGCACTCAAGTCTTCGCTGCCATGGG 


SEQED 
NO. 151 



WO 2004/041193 



-133- 



PCT/US2003/034681 





GGCCGTCATGGGCACCTTCTCATCTCTGCAAACCAAACAAAGGC" 

GACCCTCGAAAGGTAAGCCACCTTCTTCCTTTTGTTCCCCTGTC 

TGGGCTTGGGGGTGCTAGGCGCCGAGGTGGGCTGTGCCACCTGC 

CTCCCTTAGTCCGGACTCTCCTCTCCACGAGGAGCCCGGACAGG 

TGCTTGTATCCAAAGGAGAGAGAAATCGGCGGGAGGGCTGGTGT 

GAACACCCAGAGGAGGGAGCCGGAGTGGACGTCTGCCCCAGCGG 

CAACTGGACCCCTCTGGGGCACCAGGTGTCGGGACTCTCCTCCT 

GGGGAAATCTCTGAGAGCCGAAGGAAGCGGCA [A/T] GTTCACA 

GGTGGGGGTGACCGGATTCTCTGGTGGAAGTGTGGTGAAGCTCT 

TCCCATTCCCATGACAGCTGGCGTTTGAGCACTCAGTGAGGGTG 

CTGCCACACTCCCACACTCCTCCTAGGCGGCTATGCCAGGTGCA 

GACCTGCGAGTCCCTTCATCAGGAAGAGTGCTCTGTCTGCACCC 

CCAAAACCTCTGCAAGCCAAAAGGAATCAGCTGCTGCCAGGGGT 

AAAACTCCCAGGCCTCATGTCCTGGTGGCTCCGGGAGTCAGGAG 

GAGCAACCGTGAAGGGCTGGCTGCGAGCTGAGCTTACATCAAGG 

ATTAAAAAGCATAATATCGTGGAGTCTCTTCTGCCTGGACGCTG 

TTCCTTCACCACCTGTCCCCAGCCGAGGCATGGCTGATCTCACC 

ATCCGTGGGAGAGTCCTCAAATGGGTCCAGGTGAAGTTGGAACC 

AGTGTGTTGGGCC CTGGAGGACAATGCAGGTCTC CTTACCAGCA 

GTTCAAAAGTTAGTGGTTGGAATAAAGAGACTGGAAGCAGTTAG 

GAAACGGGAAATGATGGGTTTTGTTTTGTTTAATGTTCAAATGT 

CACTACGAGTGGTAAGATTTTAAGCAGCTTGACACTTAAACATT 

C AAATTCTACCATCAGAGCC CC CATCCTGGATACAGGTGGGAGT 

TAAGCTCCTACCCTACAGGCCTGATAGTGAGTAGAAGTGTAATG 

GGGTAAGGGACCCCAAGTGAACAATAAGTCTCCTCTTAGAACTT 

GGTTGGTCTCACCCTGTTTAGAACCACAGAGATCTCCATAAGTA 

AGCTGTCCTTGAAACCCCCTGGAAGAAGGGGTCCCAGCTTCTGG 

CCCAGCTCCCAGGGGCATCAGGCTGGCTGAGCCCCGAGGAAAGA 

GATCTCTGGGTGCAGATCTTAGGTGCTGAAGCTGGGTTGGCATT 

TACATCCTAGAACATAGGAAGAGGCTTTGGCCCATTTGTCCAGC 

TGAGTTACATGTCCTGCTGGCAAGG 




KCP_1044 
6 


TGGGGGTGCTAGGCGCCGAGGTGGGCTGTGCCACCTGCCTCCCT 
TAGTCCGGACTCTCCTCTCCACGAGGAGCCCGGACAGGTGCTTG 
TATCCAAAGGAGAGAGAAATCGGCGGGAGGGCTGGTGTGAACAC 
CCAGAGGAGGGAGCCGGAGTGGACGTCTGCCCCAGCGGCAACTG 
GACCCCTCTGGGGCACCAGGTGTCGGGACTCTCCTCCTGGGGAA 
ATCTCTGAGAG CCGAAGGAAGCGG C ATGTTCACAGGTGGGGGTG 
ACCGGATTCTCTGGTGGAAGTGTGGTGAAGCTCTTCCCATTCCC 
ATGACAGCTGGCGTTTGAGCACTCAGTGA [C/G] GGTGCTGCCA 
CACTCCCACACTCCTCCTAGGCGGCTATGCCAGGTGCAGACCtG 
CGAGTCCCTTCATCAGGAAGAGTGCTCTGTCTGCACCCCCAAAA 
CCTCTGCAAGCCAAAAGGAATCAGCTGCTGCCAGGGGTAAAACT 
CCCAGGCCTCATGTCCTGGTGGCTCCGGGAGTCAGGAGGAGCAA 
CCGTGAAGGGCTGGCTGCGAGCTGAGCTTACATCAAGGATTAAA 
AAGCATAATATCGTGGAGTCTCTTCTGCCTGGACGCTGTTCCTT 
CACCACCTGTCCCCAGCCGAGGCATGGCTGATCTCACCATCCGT 
GGGAGAGTCCTCAAATGGGTCCAGGTGAAGTTGGAACCAGTGT 


SEQID 
NO. 152 


KCP_3 85 8 
9 


TCAAACTTTTCATTTGCTCAAAGCCTACAGCAAACTCAGTCCAC 

ACACTTGGCTATACAAGAAAGGTTGCTTTCTTTGTTGTTCTATA 

ACTGACTTTAATTTCAACTTCAAGTC CC C ATTCTTGCCAAGGGG 

TAGAAATGGAATCTTGGTCAACTTAGGTTCCCCTCCCTACTCTC 

TGGGGTTGCATTTCCAGGCCAGGCAGTTTCTGCTGGTGCTTTTG 

TTCCTTGGTCCTCAGTCTTCTTTCTGTGTTGACATCCATTGACA 1 

TGTCCTCGACTCCCCTCATCTCAGATCACAGGCCCATGCTGACT 

CCAGGAGTATTCTTGTATTCTCTTCATCTGAACCTCAACACTTT 

TTGAGACCACGCATGCATGTGCTCTCTCTTTCTCTCTCTCTCTA 

ACACTTCTGGAACACTCTTGGACATGAGGAGATATTGGTCTTTC 


SEQID 
NO. 153 



iNSDOCID: <WO 2004041 193 A2_L> 



WO 2004/041193 



-134- 



PCT/US2003/034681 





TAGGATGGGGTCAACTGGCCCTGCCTCAGATCCATTGGCCTGTA 

CATATCTTGTAGCCATTGTGGTGCCATGGATCACAGGTCACGAT 

GCTGTGTGGCTGCCTCTGCTCTTAGACCTGCCCCCCATGCCACC 

AGAGGGAGTGTCTGCCTCCCCCTGCCCTGGACACTCAGCTGGAG 

GGGAGGGTCACAGTCCCTCACAGTCCCTTCTCCAGTGACAAGCA 

ACAAACTCCCAGTCTTCCTTTCTTTCTGATCCTCTCCTCCTCTT 

CCTCCTTCTCCTCTTCCTCCTCTCCCAGTCCAAGGAAGTTTTAT 

GCAAAGGCCAGAGGAGGGAATAATGAGGTGGAGGTCTCTCTGAC 

CAAGCATGTAGCCTTCCGGATCTGTTGTGCTTTCCAGGAGTCCT 

TCAAAGCTCTAAGCTTTTGGAATTCTGCAAGCTCAGGAAATTGA 

AAACCTTTTCTCTCACAACTGCAGGTCTTTGTCTGCAGTTGTAA 

AAGTCTGTTTAGAAACTCAGGAGACAAGCAGCATCTTCTTTGTT 

CCCTGCTTTCTGGAGGCAGTCAGCGTGGAACA [A/C] CCTGCCT 

GCAGTCTGACTCAGGGAAAGGGTCACTGAGTGTGTGTGTGTGTG 

TTGAGGGGTGGATAATAAGCAAGGAGAACACTCAGACAGAGAGC 

TCACAGAGGGGCACCCCAGCACCTCCCTCACCTCTATATTCCCC 

GCCTGGGCATAGTGGAGGGAGGGTTAATGCCAGCCAAGTTTAAC 

AGGCATTTCTGATTCGCGGCATTGTTGTTGCGCTATCCTGCAAT 

CCTACGCTGCGGGTACTGTTTTTATCCTGATCCTTCAGCTCTGG 

AAACTAATATAGAGAGCTGAGTAACTTGCTTGAGGCCATGATGC 

CAGGATCCACGGTGCCCCCAGGCTGAAGAGCCTTAACCACTGGG 

CTGTACCACCTCACAGGAGGGCAGGTGGCACAGTGCCTGGAACT 

TGGGAGGGTCCAGCACGTGGAACTATGCTCTGTCATTTACTTAC 

TGTGTGTCACTGGATCAGTCACTCAACACCGCTAAGCCTCATTT 

TCCACCTCTTCAAAAGGGATCTAATAAACCTGTTAGCAGAAGGC 

TGCTGTGAACACTAAATGAGGTGGCTTAGGTGAGAGCTCTGGTC 

TGAAGATGCTCACACTTTGAATCTCAAGACTTGTGTGAACCAAT 

ATCAGATTTCTCCTATTAGATTGCAATTCTCAGGGAGTCACATT 

CCGTCTCCAAATGCCCATCTCCTGATCCACAAAATGAGCACAAC 

ATCTCTGATAAACGGTAACTAGATGGTTCCAGTGGGCAGCGGGA ! 

GTGGGAGGGCGGTTGACTGGGCCAGAACCTCAAATGTATTCCTG 

TGTAGTTTCTCATGCATTCATTCAGTTTGGCACCAGAAGGTGCC 

CAGACTCACTTTGCAGCCAGTCTGTCGCCATAGAGGTGATAAAG 

GAAAAACATATGCACATTTAAACTTTTAAAAGTTTATTTGAACA 

TTCAGCGATTCACAAACGGTATAGCACAGACAGCAAGCAACTAG 

CACTCCTCTAGGAGGGGCCAAACAG I 




KCP_6519. 
9 


ACAGAAATCCTTAAGAGCATCAGCCGTGACACAGAAATCTAATA 
CAATAAAAC^AAGTGCTTATAAACCCCAGAGTTGTTTAAAACCC 
AGAAATTGC CAATTGACATATGGGACTATATCTTCTTAGC CC CT 
AGTAAACTGAGTGGCTTCAAACAAGTCCCTATCACCTCCCAGGG 
CCTCAGTTTCTTCACCTGTGAAATAAGAGGATCAAAAAAAGATA 
ATGTTCTCTCTGTTCTCTTCCAACCGAGGCAGGCATCTCAAGTA 
TTTCTTAGTCAGTTCTACTCTAGGCTACACAGTATCTGTATCTG 
GCAGCTGTATGAACTACTGTTGAAAATCCTCTTCCCAATCCCAG 
TTTCAACATCACTCCTCAAGGCAGCATCCACCTTCACTCTAGAC 
TGAATTAATTCCTCTGTCTTACCACCTAAACTCCTCTAGAAAAC 
TTGATAGAGGTAAAGATAAATGCATTTTTTCAAAAATTCTACTT 
TTCTAGTCCCAAGGCATTGTGTATATCATTCTTATGTAAGTTAT 
CACAATAAACCCATAATTAGTTACTTCCATTTATGTCAAATCGC 
CTACAAAGCAGAAAC ATGTATTATTC ATTTTTGGCTTCCTCC C C 
AGTATCTAGCATACGAACTGTTTGCAAACATGCCCAGTTCTTCA 
AACTTTGTAACTTCATGCCTTTTCTATCTACTACTTGGGATGGG 
CCCACCCTCCCTTTGTCCTCTAAGCACACTCCTATTCATCCTTC 
AAAGTCCAGCACAAAAATCCCCTCCTCTGTTAAACTTCAACTGC 
TCCAGGCTGAGTCTTATGTTTGGGTCCTTCATACGTACCCCTCT 
TCTATTGTTTGGGGTATTGTGTGCTGTGGGATCTGTTTACTCTC 
AGTTCTCCCCTCTAGGCTGGGTTCCTTGAAAAACACCCTCTGGA 


SEQID 
NO. 154 



WO 2004/041193 



-135- 



PCT/US2003/034681 





C7VTTTCACCTCTACATCCTCTGCATTCTTGGCCAGGCTCTGAGA 

GGGCATTGGTAAATGTTAACTGCCTGGCAATG [A/G] TGATGCT 

GTTAACCTGATGTGTCAGGGGTCTGAATAAAGCTGCCTCAAGGT 

AGG CAGATGC CCACAACCAAGCAAGAACTCAAAGCTGCAGGCTC 

CTCAGCCTGAACCTTAGACAGCGTCTTGGTCACCATTTCAACAC 

CTTGACCACATTTCTCACTCTCCCAAATTTCCTCCTGCTTATTC 

CTCATCCACATACATAAGGCTGTGTCTCCCAGGGGAAATTCAAC 

TACTTGGTAATTATCCTGCTTCTTAAGTTTGGGGCTAGGGGATT 

CATAGATGATGTTCAGTATTATGCTGTGCAATGTAGATGCTTCC 

TAAACCTTCTCAGGAGCTACCACTGAGTGGCACCTGGGGACCTC 

TCAGGAAGAGCCAGTTTTCTGGGCAGTGTGGGGCAGGACAGAGC 

TGATTAAACCAGCCTACCACCTGTCTTCCAGCTCCTCCTCTCAG 

CCTCTGGGCTTCCAGCAGAAAGCACACGAGAGCATTCTTGTTGG 

TTTTCTTATGACTTGAGCCAGCGAGACGTACATGCC CAG CACCT 

GTTACCTGGGCTGGCTCTTGGCTGAGAGCATACATGCATTGGGT 

CAGGTTTCAGATCTGCTGGAGGAACACAGCCAGAATGTCTTGAC 

AGGCAGCCCTGGCAAAGCCCCAGAAAATATAAGATCTGAGTCTT 

ATGATGGACTCTGTGACCTTGAGCCTCTCACCTCGTGACCTTGG 

GCATCTCATGTTCTCTCCACAGGTCTCGGTTCTGGACTCCTTCA 

TGGGAGCTGTCATGCCCCTGTCACACAGCAGTGTTGTGCCCCCG 

GGGATCAGGGACCAGGATGGTCCTTTCTTGGTGGTGAAGGGGGC 

ATTTTGCATATTC CAGAGATTCAAGTTTCCAGACCTATCTAGAA 

AGAAACATTTGAGTTTACAGGTTGGCGCTTCTCAGCCTCTGTCT 

CTCTTCCTCTCTGTTCATCTCCCTCTGTCCCCTCTATGTATGTT 

TGTGTCTCTTTCTGTCTCCTCTGCC 




KCP_8246 
8 


CTCACTGCCTGCAGTTTATTCAGGCATTGGATGAGACAGCTTCT 
TCCTGCTCCATGTGGAGTCAGCTGGGTACTTGAACTGGGACATG 
GATGATCTACTTTCAAGATGGCTTATTCTCAGGGCTG C C AAATG 
GATACCGGCTATCAGTTGAAAGCTATT^AGCAGGGGCACTCTGCA 
TAAGCATGGCTCATCTCTACAAAAGCTCCTCCCCAGTCTCCTTG 
TTTGGGCCTCACAGTGTATGGTAACCTCAGGGCAGTCAGAATGT 
GACAACTAAAGACTTCAGGAGTAAGTATTCCAGGAAGCAAGATA 
TAAGCTATGTGGCCTTCTAAGACCTAGCCTCAGAGGTCACATAG 
TGTAAC CTCTATC ACACCCTATTGGTAG ATATTGTAAC AGAAG C 
CCACCCAGTTTCACAGATGGGGACATAGACTCCATTTCTTAATA 
GGTAACTGGCCAGAGTTGTAAAAGAGCATGTGGGATGGAAGATA 
TTGTTGCAAGCATCTTTAGCAAATACAACTGGACATACCCAATG 
CAAGCACAGGATTGATCCTCCACTCTGCCCCCATACCCCATGAT 
TTATTAGCCACTCGGACAAGTGACTTCAACTCTCCAAGCCTCTG 
TCTCCTCCACTAAAGTGGGGACAAATGAGTATTACAAATGAGAC 
CATTAAATAAGATAATACATTTTAAAAATTAACCTGGTACCTGT 
CACAAAGTACATGCCTAACAAATGTTTGCTTCTGTCTCACTTCC 
TCAATTTCATCTCAGTCAACCTGGACTGACTCAAAATGGCATTC 
TTCTTGGCTGCCCCCTTTGAAGTATTTCTGCTGAGAAAATAGTT 
TCTGTGTATTTGTAAATTTACAGGTTGAACATAGATCATTATTC 
AAGCATTGCTGGTCGATTCGTCTTTTCAAAGGCGGGAGCTGCTG 
GCTGTGGGAAGGGACCCAGCAGGGGTCTCTTGCAACCCTGCTCT 
ATGGGTGGGGGAAATCTGGACCTCCCTCTGGT [A/G] GGGTTGA 
TTGAAGTGAAGGGTCACCATATGTCTTTCCCAAGAGGGTGACTG 
x ± v-v- ± vju i j. jl bb l L.UUAGTTTCCCTGAGATTTTCCTGAAAGC 
CCTTCCGGCTAGCCCAGTTGGGAGTGTTAGTACATCAGATCCCA 
TGCTTTGGTGAAAAATGTAAACACAGACCTGATTTTTCATTTTA 
AATGAAGCCAAGCATATTGCTCCCAGCAGATGCCGAGTGACTCA 
ATCTGTCCTCTCGGTTCTGAAGGGAACTGAAGAACAACATGGTA 
AAATAAAGCAAACAGCACATTTATTGGTTGATAAAATGCTGTTT 
TAGTCTACCCTGGCATTATATGGTGATTGCTATGTGGCGAACAT 
CTGTTATTAAATCCAGACTTCTGTTGCCTGGATACATTGAGTCA 


SEQID 
NO. 155 



NSDOCID: <WO 2004041 193A2_l_> 



WO 2004/041193 



136- 



PCT/US2003/034681 





AAAGCTGGAGCGGATGAGAAATCCATTTATGCGTCTGTTGCGTG 
TGAATGTCAGAGCTCATATGATGCCTTTGTCTTCATTCTAACTG 
AATCTTTTAATATGGACCGTCTCACTTGTTAATTCTGACTCAGG 
GGCAATAATGTTTTCATTTGATTAAAAAAGGTTAAAGAAACAAA 
GAAACAGTGTTTTCTCAGGTGCTCTAAGTAATTCTGTTAATGAA 
TTTTCGGAGACAGCGTGTGAATTTGAAAAGAGTAGGACTTTTTA 
AAGAGTTCATACTATGAACCCAATAATTCAGATCCTAGGGCCTT 
ATCCTAAGGACATAATAGAAATGAGCACATTTATAAGAACAAAG 
ATGTTCAATGAAGTGTTACTTACAACAGCAAAAAAACTTGAAAG 
TCACCTAAATGTTTGTAAGTCAAGAGCTTCATTGATATTGACTG 
CAAAGTCCATGTTATTCCATGTGACGAATTTTTTAATCAATCAC 
CTCTTGATGGATTTTAAATTTTTTACAATTTTTTGCTATCCTAA 
AAAAAATGTGTCAATGAACAACTTTGAACTACCCTGACTACCAC 
TTTAGGATAGATTGCTAGACGTGGA 




KCP_857 9 
3 


ATCACCCCAAATAGTTATGATGAAGGTGATCTATGTACGACACT 
TAGAGAATCAGTGATGGAAAATTCACCAAGAACAGCCACAGGCA 
GGCCAGAAGAATGGCCCTGCCCCTCTACTTTTAGGATTAAGCAG 
AAGCTGGCCCTAGATCTCACCAGTTACCAGTGATCTTGGGCATT 
TTTAGCATCATGTGCATTGCTTCACTGTGATACCATCTTGCTGG 
CACAGCCATGGAAAGCCATGAGTTAATGCATCTCCCCATGTAAC 
AAACCTCCCCTAGGACTCTGGTCCACACCTATCTCTGCTAGATT 
CTCTGGCATTGCAAGAAATTCTTCAGACTGCCCCAAGAGATTCG 
TTCCAATCTAGGGGCTCCTTATCCCCAGCTCAGAGCTGGATTTG 
GCTCTTGCTTGGAGGCGGGAAGCCCTGCTGGGCCAGGGCTTAGA 
GGGGCTCACAAGAAATCAAAGCAAGCATTCTCCGCCTCTCTCCT 
ACAGCCCTGCATGCATCTTCTCTGATCCCTTGCCTGAGTGGGGG 
GTGGCATTCCAAAAGCTCATTACTGGCTTACATACTTTGCCTTA 
AATCAGCTCTTAAATGCCCTGGGATGAACAGCCCTAAATAGGAA 
AGAAAAAAAAAAAACAAGTTTCTTGCAAGTTCACAGATATGCTT 
GGTGCTTTCTGTCAGGCTAGGGTGTAGCCTTCTCTGTTCTAAAT 
TTGATTTTCTGAGTCTTTAAGGAAAAATGGCTACTGGTCCCCTG 
GACGCTGATTGCTTCAGCATCTGAATCTGCTCCATCACTTCTAC 
CTCCACCCACTGGTCCACGTCCAGTGGGTAGAGGTAAAGGGGAT 
GGAGATATCATTTATCTTCAAAGGATAAAACTGCTCTGAGAGAT 
CTTTGCTTTCTTAGT^AACACTGCTGGAAAGTTGTTTCTTTAGAC 
TACATTAACAGAAGTACCATCTCTAGGAAGACAAGGTGGTAATA 
ACTAACATCAAATGAGCAGTTCCTATGTACCC [C/T] GTACATG 
TCTTAGCCAACTTCATCCTTGTAACAAACCTGGAAGGCAGGCAC 
TGTTATCACTCTTATTCCCAGGTGAACCAGTTGAGGTTCCAAGA 
AGTCTTTTGTGCAAGGTCATGCAGAGTTGAGG C CCCCAAGTCGG 
TAGACTTCAGGAGCCAGACCCTCAACCCCCTCACTGCCTCCCGC 
CTCATGCTGCACTGAGCAGACCATACCCGGATGGTCATGTTCAG 
GTTGGCTATCAATGCAGACCACGCTGGGCATATTCAGGGGACGG 
ATACTCAGAACTATATAACATAAGGAATAGAGGAAGGACTGGAG 
GATGTATTAACATGAAGAAAAGGTAGACTCATGGCAGGAGATGA 
GCAGGGTAAAGAGGTGCAAGACATAAAAAGCCAATTTCATATAC 
ATGAAGATTTATCAAG AGC CAGAAGGCC CTCT ATGGGTCC AAG A 
GTTACAAGGCCTAATGAGGTGAATTAATGCCAGCATATAAGGAA 
AAGCTTTTGAATACTCAGAAGTGTCCAAAAAGGGGTCAGGCTGC 
CTTGGAAAGTAGTAAGCTCTCCATCAGAGGCTTGGCAACTTCTT 
ATTAGGGATGGTATGAGTATCTCAAGTACAGATACAGATGACCC 
AAATAAC C ACTGAGGCACTTCTGACC C CAAGTATAAGAGATTCT 
ATTGTAACGCACAGGAGTCCATCTCAAGCAGCACACTGAGCCAT 
CTCCTTGATAAACCTAAAGGTAGGTATTATTCCTCCCAGATGCT 
GTCTTCTTAGCCTGGGATGCAAAAGCCATAGGATCACTTCACGT 
CCAACC C CC ATCAGGTGATCTGTCATGAATCACAAGTTATTGG A 
GCCAGATGGAACTACAGAGCTAAAAGATACATGAAGACACCGAG 


SEQID 
NO. 156 



JNSDOCID: <WO 2OO404U93A2_L> 



WO 2004/041193 



-137- 



PCT/US2003/034681 





GCCTGCAGACAGGGACTAACTTTCCAAGGTCACAGAGCTAACAA 
GTGTCAGAGTCAGGCTAGACCCAGGACTCACAAGTTGAGCTCAC 
AATTAGTTCCACTTCCTACACCACC 




KCP__93 54 
5 


CCTGAGCCTCTGCCTCCTTCTGAGAAAGACCCTTGTGATTACAT 
CAGGTTCACCTGGATAATTCAGGATAATCTCTTCATCTCAAAAT 
CCTTAAGTTGATCACATCTGCAAAATCTCTCTTACCATGTAAGG 
TAACATATTCACAGCTTCTGGGGATTAGGACATGCATCCCTAGG 
GAACCATGATTCAACCTAGCATGGGGGAACCCACTACAGGCAGG 
TGTTGTCCTTGCCATCGCCAGCTCAGTGCTTGGCACAGTAGAGG 
CCATGGATATTCATTCAGAGAGAGCATGCACTGAGGCAAGCCTG 
ACCTCAAGATCAAGACAGGAAATTGGCTTTCATGGGTTAAGGAC 
CTGTTACTTTGCTCATCAATGTATCCTTAATCATCAGAGGTCAG 
ATCTGCTGGAGAGTGCAATCTTTCAG [G/T] TTCCAAAAGTAAG 
ACTGGATGCCTTAGAACTTAAAGTCAGGGAGGTACCCAAGAAAG 
CAATCATAGACTGAGTCCCCATGCAGTGCACTTTCTCGGATGGA 
CAATTTCTCTGTTCTGACAGTCACTGTTGACTCCATTTCTCAGA 
TGAGGGACCGAGGCACAGAGAGGTGCAGTCAGTCACCTGAGGCC 
ACACAGTCAGGAAGTGGAAATCCATGGAAACTCATCATCAGCTG 
CCTCGCATCAGGGCCAGTGCTCTTTATCTCCACCCCACACATTA 
TAAAGCCACTCAGCTTTACACTCAAGGGAACTTCCTATTTCCCT 
ACTGGATTATATGTATAATTTGTAGTATTGCAAGATTTGAACAG 
AAGCGAGCAGCAGCTTGTAGTTGTGTGTGTCACTCACTCCTGCC 
TGTGGGGATGCCACGTGATTGTTTAAAGGGTTGGAATCAGGAGA 
AAGGCAGGCTCAGAGCAGGACCAAGAGAGAGCCCACCCCTCGCC 
TCCC 


SEQID 
NO. 157 


KCP__97 84 
4 


ATTATAAGTATATACCACACTTTGTTTATCCATTCACTTGTCGA 
TGGAAATTTGGGTTGCATC CACCTTTTTTTG CTATTGTGCATAA 
TGCTGCTATACACATGGCTGTGCAAATATCTAATATTAGTCCCT 
GCTTTCAGTTCTTTTGGATATGTATCCAGAAGCAGAATTCTTGG 
ATCATATGGTAATCCTATTTTTAATTCTTTTAGGAACTGCCATA 
TTGTTTTCCACAGCAGCTGCAGCATTTTACATTCCTACCAGCAG 
TGCACAAGAGTTCCAATTTCTCCATATCCTCACCAACACTTGTT 
ATTTTCTGTTGCTGCTGTTTGTTTTTTTATTAATA<3TCATCCTA 
ATGGGTGTGAAGTTGTTTCTCATTGTGGTTTGCTTTGCAGGTTT 
TGATTTGTAGATTTTCCTGATGATTAGTGATGGGTGCATCTTTT 
CATGTTCTTACTGACCTTTTATATATCTTTCTTGGAGAAATGTC 
TGTTAACTCTACTCATACTTTTGTAAATAGTATTCCCAATCCTT 
CTAACTCCCCAATGAGGTGGATATTAGTATGTTCGTGTTACAGT 
AAAGCCAACTAAACCTTAGAAAGACTAGGTT [A/T] ATTATCCA 
AGGTCACACAGCTAGAAAATGACACAGCTTGTATTGAAACATCA 
GTTTTTCTCTTTC CAAAC CTAACG CAC ATTTCATGAAAC CTAC A 
TTATTGCACCATAACATCATGTTGATTTACTTATCTGCTCTCCT 
GCCTGTCCCATCTACTACATAAATTGAGTGTGGTTTGAAATCAG 
AGACTACTTCTCATCTTTGGCACAGTGGCAGCCATGGATCAGAA 
TCTCTTACATGCTGGATAAGTGGATGCAAGCTCAAGGCCACACC 
TAAAGTC CCCAGGTGACTTGATCACTTGAGTTAGCTG CTGGAAA 
CCTGGGCTTCCTCTTCTGCAAAATGGGGAGAGAAAATAAATTCT 
CAGTGGATTGTTTAGAAGATTTGAGCAAAGACCTCTGCAAAGTG 
CTAAGCATGTGGCTAGCATGTGGCAGGTGCTGCCTAAATAGTAG 
AAATTAACACTGCCATGCTTATAAGCTCCGGACAAACACAAGAA 
GCCCGAAACATAATCTGTGCCTTCTGCTTGCATTCCTCCTAGTT 
GGGGATGTAAAATAGCCCAGCTACAATCAAAGAAGAAAATCAAA 
GTCAGCACAGACTATGGATATGCTTCTATATGTGTAGATTATTT 
CCAGACTCATTCGGAAGAATCTGGACATACTGGTTGCCTCAGAG 
GTCAAGAAAATTGGCTCATTTACTTCTGTAACTTAATTTCGACT 
CTCTATGCTTTTACATAGTTGGAATTTGCCATGCACATATACTA 
CATTTAAAAGAGCGTGTACGCG 


SEQID 
NO. 158 



NSDOCJD: <WO 2004041 193A2J_> 



WO 2004/041193 



-138- 



PCI7US2003/034681 



KCP_102 8 
82 


CA.CAATTATGCTGTAGGTGAGTTTTACCTTGGGAAACCAAGGCA 

CAGAATTTAAGTAACATATTGAAGCTCATGCAGCTGCTAACAGG 

GAAGGCCAGGGTCTGAACCCAGCTGATCCGGCTCCAGCATCCGA 

GCTCTGAACCACTGGTCTATCCTGCCTCTGTTAGGACTTGGTCC 

AATGTCATCATCCTAGAAGGAACATTTAGGCCCGCACGGTGGGT 

GGCTGGTTCAATCCAGTTTAAAGGCCAGGAGCAGGACAGTGACT 

TGCAGCTGCAGCAATCCTATGACTCAAACCAAAGCAGCTGTGAC 

AAATAAAGGGACTGACTCTCATTCTCCCGTGCTAGGGAAGGATG 

AGCTATCAGGCCTTGTTGCAGGCTGAGTCAGTCATCCCACAAAC 

CACCTAAGTGAAACCTCTTCACTGAGCCTTATTTCCTGAGCGCT 

CTCCCTTTATCTGTGCTTGCAAAGAGG [C/T] GTCTCCCTCCAT 

GCCAGCCAACCCACCCACCCCCGCACACACATACCACCTCTGGC 

TGGAACTGACGACCATGGGTTTTAGAAATGAGATAAATCTGGGA 

bAibAA J. oTATTCATGAGCCCATAAAGGGGTCATGAATCACTGG 

CCCCAATTACTGCCTTCAATCCTGACAGGATGAATTCCCTCAAG - 

CAGATTCTCCTTGTCAGACAACACGGGAGGCAGTGTCATGGCTG 

ATCTAGAGCCACAGATAACATCATTATTCCATACCAGGCTGGTT 

TCGGTTTCCCAAGCCACCTCCACTTGATTTACAGCTCACTTCTG 

ATGCTGGAGAGAGAGATAAATATATATATATATATATATATATA 

TATATATATATATATATATATGAAAGAAAGAAAGAAAAGAGAGA 

GAGAGAAAGACACAAAGGGGAAGCTTTCATGCC 


SEQID 
NO. 159 


KCP__10 73 
80 


ATCCCAATAGGACACATGTTGTATTAAAAAGCCATGCGAGACGG 

AAGAAGGAAATTGAATGAAATTTGAGGGCAGGTAGGAGCAGAGA 

CAATAAATAATTCAGCAGTGAAGGAAGCAGAAAAAAGATTGCAC 

TCATTTCGCCCTTCAACAATTATACTAAACACCTGCTCTGGGCC 

ACAGAAGGGCCAGATCCCATTCCTGTGCTCAGGAAGCCCACAGG 

CCGGCAGGGAGAGGCTGGTTGGAATGTGTGCTTTGCACTGTAAC 

GGAGGCATCGAGCATGGTAAGGGACTGGCGGTGGCTGCTGCCTG 

CGGACGTCGAGCAGGGGCCTTTGAAGAGGCAGGACCTGTCTGGA 

GTCTTACCTGGGCCTTGGCCCTGGCAATGGGGAATGGAGCAGGC 

AGCAGGGGACAGATGCTGCCAGA [A/G] ACCGAGATGGTGCCGG 

AGGACTGGGCTGAGTCTGGGTCAAATGACACCGCCCCAGGCTCT 

CTGCCCTCTGGGGTGAGGCAGGAGGCTGCCTCTGTGTGTGATTC 

AGAGACC CTAGAATCCC AGTGGCCATCACCCCACAG CACATGC C 

AACCTTTCTGTGATAACTTTCTCTTGTGGAACTGTGAAAGTGTA. 

AGACCAGCTCCTGTATAGTGCATGGCCATCCTTTGCTTTGGGGA 

CAGTAAGTCAGTCAACACATACTTATAAATGGGGTCCTGGGCCG 

TGGCACTGATCTGGTCCTCCCACCTTGCCTCACACTGCCCTTCC 

CACTCACCACTTCCCTCCTCTGCATCTTAGCCGCAAGGGACTTT 

CAGACCAAGCAGACCTGGAATCAAATCCCACCGCTGGGCCTCAA 

TGCCAGTGGAGACAGGAACAGCTGATCCCTGGAGCCCTCAGGAG 

GAAGAGGACGGGATGCCTGGC 


SEQID 
NO. 160 


KCP_10 8 7 
03 


CCCACTCACCACTTCCCTCCTCTGCATCTTAGCCGCAAGGGACT 
TTCAGACCAAGCAGACCTGGAATCAAATCCCACCGCTGGGCCTC 
AATG CCAGTGGAGACAGGAACAGCTGATCC CTGGAG C CCTCAGG 
AGGAAGAGGACGGGATGCCTGGCTTGGCTGCTGGTCTGGGGCAG 
GTGCCCAGTTACAGCAGTTGGAAAAATCCTCAGTGTTGGAAGGA 
AATTTGGAAGTGAGCATCTACCTGCCTGCCGTGCAGTTTGTGAC 
TTTTAAGATGGTTGACAGAACATTCCCAAAGGACCACAGCGGTG 
ACCACTGTTCTCGTTTCCCTTTGGTGGCTCACTCACTCAf5Tnr , T 

GGACACAGTGGTCCTGACAAGACAGTGCTGTGGCTTCCATGAAC 
CTAGGACAGGGATAGACTCAAGGACTAAGAACAAACCAGGAAGA 
AGCATCACCACAGGCTCCTTGCCAGTCACCTCATCTCACCCTCC 
TGGCCCTGGCGGATGGGTCTCCATATTTACAGGGGCGAGATGAA 
AAAACCAGAGGAGCCAGGAAAAGGAGCTTCCCCTTCCCAAGGGC 
GCAAGGTGAGGTGCCAGTCATGAGATGCAAGCCCTGAGCTTTCT 
GATTCCACTGCATGTGGTCCCAAGGTTCGGCGCCGCATCACACA 


SEQID 
NO. 161 



WO 2004/041193 



-139- 



PCT/US2003/034681 





GTTAGTGAGCACACTCTCCTCCCCTGGCCCCGAGTGAGCCAGCT 

GGATGG CAG ATCAGAAAGAGAAGTC CCGGGTGCCCCCAACATGG 

CTAGCTCCTTCCAGGACCAGGGGCTAGGCCCCAGCTAAGGCTGG 

TGCACACAGCAGGGCAGGGGGCGAAGGAGTGGGATCCCACCCAG 

GGATCCCACCCACCCCAAACCTGCTTTCGGACATCTTTCCAATG 

CATAATGTGCAGATGAGGCCCTTTGATAAGGACCAAATCCCTTT 

CCGTTGCTTGGCAACCTGGCTCACAAGTCATAGCAGGGAAGTAA 

TTTACAGGAATTCAAAGTGTCGCTGGAGGTTC [G/T] GCTGAGC 

TGAATTGCTGCAAAGAGGAACCTCAATGGTCCAAATCACACCTC 

TGGCGGGGAGGAGGGGCTGAAGGAAAAGCTTCCACTTCCGTCAC 

TTGAGAGTACAGAGCCCTGAGCTCAGACTCAGCGATCGTTTTCC 

ATTAACGGATTTACTGGTTCCATGTTGAGCTC CTGCTGTGTGG C 

AGGCCCTGTGCTGGGAGCCAGGGACACAGTGACAAACGAGACAG 

ATGCCAACC CCGGATG CACAGAGCTCAAAGAG ACAGAGGAGTAA 

ACAGGGCTACACATGTGACAAGATAGGCTGTGCACAGGGGTCTG 

AGCAGGACCCTTGGGGCAGGAGGAGGCAGTGGAGGGATGGGAGG 

GTAGGGACGCAGTGGTGACCAGCTAGCCAGATAGAGAACAGAGG 

GTGTCCCAGCACAGGGCCACACAAGCAAAGGCAGAGGTGGGGAG 

AGAAGAGCCTGCCACACTCTCAGATCACCATGTGGTTGGGCCAG 

GGCCCCAGCTGAGGCTGAGGACACATGGAGCCCAGATCCGGCAG 

GGCCTTGAATGCCAAGTCAGAAAGCATCTGAAATTTAGTCTACA 

GATGATGTGGGTTATTGACAGCCAGGACAGGGAATGACATTTGT 

GTTTCAGGAAAACCACTGTCTTCACTGTTAGGGGGTAGATTCAG 

GGAGAAACAGGAGGTGGAGGGGAAGAAACTGTGAGTAAAGGAGT 

CTCTGGGGTACAGGTGAAGTTTCTGTGAAACTGGAGAAGAAAAC 

TGTTGAGGCAAGAGTTGACAAAACTTGAAGTAGGATGGAGAGGA 

AGGGACAAGTTCCCCTGGCATGGTGACGGCCCGGTGGTGGGAAC 

CAGGGAAGAGGAGGGGCTTTGCAGGTGTCTGACTTGCCCAACAG 

GTGGCGCCATTTACCAAGATGGGAAGGGCCGGGGAGAAGGGAGG 

GTTCCATTCTAGGGAAATCTCAGGTCCTCGCTATTAGGATTCTT 

TCGGTTGCCAGTGACTGAAACCCAG 




KCP 1248 
77 


ACTGTCTAGATCTGGGGACCCTCCCAAGCTCTCAGAGCTTTGGA 
AGGAAGGTCCCTGCAGGGAAACTGTGTGTGTTTCTTCAACAGTG 
TATCCTCAGTGCCTAGCACATGGTAAGTGTTCCATAAACAGCTG 
TTGAAGAGACGGATGGATAACTGAATGAATGGATGCTTCCATGG 
GCAATGACACACTAATCTGAAAAGCCCTGTATCAATGAAAGAAT 
CACTTAATAGTTTAACTTTTCCCTCATCCTTCAGAACACAGATG 
GCATGCCATCTTCCCTTCAAATCTCTTCCCAGTGCCCCACACAG 
AAGAGGCACACTTGGACACTGGTGTCTGATGGACCCAAGTTCAC 
AGCCTGTCTCTGGTCATCAGGTATCATGACCTTGGGCAAGAAGC 
TTAACTCTCTGAGCCTCAGTTTCCCCTTCTGTGCCCCAGGGAAA 
ATG AGTC CTGCC CCTCCTAAGGGAGGTATGAGATGTAAGACCCC 
GAAGGACACAAAGGTT [C/T] GCCAGGAGCCTTCAGGTAGGAGG 
CAGGTAAGGAGGTCTGCTAGATTGGAATGAGTTTCTGGAAGGCC 
CCAAGGAGCTCAAAATCAGACCTGGGGTGAAGGTGTCTTGACCA 
AAATGAGACCCATCAAAGAAGCCTGGATGAAGGTGCCCACAGCA 
TCCATGAGTGCCAAAAACAGAAACACTTTAGCCCAGGATACAAG 
GAACATTTTAAAGCAACAGAGATAAGAGATAGTTAGAACTCAGG 
CCTCCTGGCTCTTGCTGTTCTTGGCCCATAATTAGTTGTTATGG 
bALLi 1L1 I CTTGGTACCTTTGCCAAACAA 
TCTGATGAGGAGAATATTGAGTCATGGTGCCAGGGAAAATTAGC 
ATATTCTGCAAATTCCTGGCACTGTTAACACTGGATTCTGTCCA 
CCTTTAG AAATCCTCAGATCACTATGTCAG CATCCCC CAATCAC 
AGCTCTCCAACTTCAAGGAGGGTTGAGGGGTCTGAAG 


SEQID 
NO. 162 


KCP 12 60 
86 


AAGAATATCAGTTCCACTTCCCTTGTCCCTAGAGAGCCTTGTAG 
TGGATGTTGATGTGTCTTCCAACACATGCACCAACCTTTCCCTG 
TCCTGTAGCAGTTGAGATGGAATCATCCCACTCCCAGCTCCAGG 


SEQID 
NO. 163 



INSDOCID: <WO 2004041 193A2J_> 



WO 2004/041193 



-140- 



PCTYUS2003/034681 





AATAGGCTCTGATGGGCTTGAACCCAGCAGCTTAATTCCATTGG 
TTCTCTAGGCCTTCATCATTAGTACAGGAAAGGCACTTGACCTA 
AATTAGTTCGATAAGATTTAAGCTCAGAAATCTGGTTTGTTGGA 
TGGAGAAAGAGATGCTTTCTTTCTCTCTGGAAGGAGTTTATTGC 
AAAAGTAAGGGCTGGGGCTGCTACAGCCATTGTGCTACCATGAG 
GGAACTAGCCATGATAACAAAACTTGCCTGGGGAGGGGCTACGC 
ATCAC AGAAAATG ATGC CAAAGTCCTGCTCAAACTGTGC CTGAT 
GCCTGCCTGATCTATGGACTTCTTAGTTCCATGTAATGGATTCT 
CTCTATTTTTAAAGCC [A/G] TATC AGG TTGAATTTTTGG AG AA 
ATAAAACAAAAAGCATCTTGACTAATTTAAAAAATCTTCTTTGG 
GTATTCAACCCTCCTAAACTCACCCCCAAATCCACTGGGAGCAT 
GTCAAGATTTTTGTGAGCCGATTTAGGAGATGCAAATTCATTTG 
CCTTAATTGGATCTCCAGGAAATGACTTCTGCCCCCTCTTAAAT 
CATTTAAAGCTCAAAGAGGCATGAGGGCCCTCCCCAAGGATGCA 
GGTATCCTCTTGACTGACAGCCTGTATGCTCTGCTTCCAGGATC 
CTTCCATCTCCTCCCTTTACTGAGGGAGTCTGCTATGTGTTAGA 
GGTGTCCATCACTGGTCACACTGGGAAGCTGTGGCAGGGAAGCT 
GGAGAAAAAGCAAGATAGGCCCCAGAAAGAACACCAACTCCAGA 
CTCAGGGAGACTCAGGCCAGAATCCTAGCTCAACTTCTTCCAAG 
CTCCCAAAGTCACACTCTTTTCTCTGAGCCTCGATTT 




KCPJL2 62 
08 


ATCCCACTCCCAGCTCCAGGAATAGGCTCTGATGGGCTTGAACC 
CAGCAGCTTAATTCCATTGGTTCTCTAGGCCTTCATCATTAGTA 
CAGGAAAGGCACTTGACCTAAATTAGTTCGATAAGATTTAAGCT 
CAGAAATCTGGTTTGTTGGATGGAGAAAGAGATGCTTTCTTTCT 
CTCTGGAAGGAGTTTATTGCAAAAGTAAGGGCTGGGGCTGCTAC 
AGCCATTGTGCTACCATGAGGGAACTAGCCATGATAACAAAACT 
TGCCTGGGGAGGGGCTACGCATCACAGAAAATGATGCCAAAGTC 
CTGCTCAAACTGTGCCTGATGCCTGCCTGATCTATGGACTTGTT 
AGTTCCATGTAATGGATTCTCTCTATTTTTAAAGCCGTATCAGG 
TTGAATTTTTGGAGAAATAAAACAAAAAGCATCTTGACTAATTT 
AAAAAATCTTCTTTGGGTATTCAACCCTCCTAAACTCACCCCCA 
AATC CACTGGG AG C ATGTCAAGATTT [T/C] TGTGAGCCGATTT 
AGGAGATGCAAATTCATTTGCCTTAATTGGATCTCCAGGAAATG 
ACTTCTGCCCCCTCTTAAATCATTTAAAGCTCAAAGAGGCATGA 
GGGCeCTCCCCAAGGATGCAGGTATCCTCTTGACTGACAGCCTG 

GGAGTCTG CTATGTGTTAGAGGTGTC CATCACTGGTCACACTGG 
GAAGCTGTGGCAGGGAAGCTGGAGAAAAAGCAAGATAGGCCCCA . 
GAAAG AACAC CAACTCC AG ACTCAGGGAG ACTCAGG CCAGAATC 
CTAGCTCAACTTCTTCCAAGCTCCCAAAGTCACACTCTTTTCTC 
TGAGCCTCGATTTTCCCATCTGCAAAATGGGGATACTAAGGGTC 
AC CTAGCTGGGCTGCC CTGGAGATTC CAAGACATTA 


SEQID 
NO. 164 


KCPJL2 90 


GGGTCCTAACAGGCCACAGACCCATCCGTGGCCCAGGGGATTGG 


SEQID 
NO. 165 


93 


GCTCTTGTTGCCCAGGCTGGAGTGCAATGGCACGATCTCGACTC 
TTCAACCTCCGCCTCCTGGGTTCAAGCCATTCTCCTCCCTCAGC 
CT CC CAAGTAGCTGGGATTAC AGGCACC.CGCCAC CATACCTGGC 
TAATTTTTGTATTTTTAGTAGAGATGGGGTTTCTCCATGTTGGT 
CAGGCTGGTCTTGAACTCCCGACCTCAAGTGATCCGCCCACCTC 
AGCL ILLLAAAb rGCTGGGATTACAGGCGTGAGCCACCACGAC C 
TGCCCGGGGACCCCTGTCTTAAACCACCCCAGCCTGTGATACTT 
TGTTATGGTGACCCTAAGAGGCAAATACACCCTCCTTTCCCCAA 
CCTCTCCCCTCAGACGAAACCGATGCGAAAAGTGCTTCATGAAG 
TTTCAGGTAAAGAAGT [C/G] TGGGACGAAAAGGGATAGTGAGG 
ATGGCGGGAGGGGCTGAACTCCAAATGGGCTTATCAAGGCTCTG 
CAAAATGGCGTGACGGCGCTGCCCCCTTCTGGTGGCCTGAAGAC 
TAACGCACATGATGTCAAGTGCGGGGCCCAAGTACTCAGGAAAA 



JNSDOCI D: <WO 2004041 1 93A2_l_> 



WO 2004/041193 



-141- 



PCT/US2003/034681 





GGTTCTCATTTGGACACTGGGAGGTCTTACATTGGGGGCCCTGA 
GCCTCCAGCCCTTCCAAATCTATTCTCAGCAGGAGCTCAGCCAC 
ACCTGTGTCCCAGAACTGAGGCCAGGCCCAGCCTTCACTCCACG 
CCCAGCCAGCCCCAAGGAACCGACTCCCTGAGGCTCTATGCTCC 
CTGCCTCCAGTGGCCCCGTGTCTGGGAAATAGTGGCCCTGGCCT 
GATGGCCTGACCTGGGCAATCCATCCCCTGGTCCTCTCAGCTCC 
CGGGCCCAGGTTTTCTGGGCTACTTTAACCAGGGCAAACTCATT 
CCTCGAGTACAAAATAAAAGATTCGAACAGCATAATC 




KCP_12 91 
2 7 


GGTTAGTGGGATGCAGCGCGAGGCTAAGGAGTGTCTGGGGCCAC 

CAGAAGCCAGGGAAGCCTAGGAAGGGTTTTCCTAGAGCCTTTGG 

AGGGAGCACAGCCCTGCTGACACCCTGACTTCAGACTCCCAGCC 

TCCAGAGCTGGGAAGGG ATAAGTAGCTGTTGCTTTAAAC CAGTG 

GTCCCCAACCCTTTTGGCACCAGAAACCGGTTTTGGTTCAGTGG 

AAGACAATTTTTCCACGGACAGGGTGTGTGGGGTGGGAGATGGT 

TTCAGGATGAAACTGTTCCGCCTCTGATCATCAGGCATTAGCAT 

TAGTTAGATTCTCATAAGGAGTGAGCAACCTAGATCCTTCGCAT 

GCGCAGTTCGCAATAGGGTTCATGCTCCTATGAGAACCTAATGC 

GGCGGCTGATCTGACAGGAGCGGAGCTCAGGCGGTAATGCTTGC 

TCGCCAGCTCACCTGCTGTGCAGCCGGGGTCCTAACAGGCCACA 

GACCCATCCGTGGCCCAGGGGATTGGCGACCCCTGTCTTTTTTT 

TTTTCTTTTTTTTGAGATGGAGTTTCGCTCTTGTTGCCCAGGCT 

GGAGTGCAATGGCACGATCTCGACTCTTCAACCTCCGCCTCCTG 

GGTTCAAGCCATTCTCCTCCCTCAGCCTCCCAAGTAGCTGGGAT . 

TACAGGGACCCGCCACCATACCTGGCTAATTTTTGTATTTTTAG 

TAGAGATGGGGTTTCTCCATGTTGGTCAGGCTGGTCTTGAACTC 

CCGACCTCAAGTGATCCGCCCACCTCAGCCTCCCAAAGTGCTGG 

GATTACAGGCGTGAGCCACCACGACCTGCCCGGGGACCCCTGTC 

TTAAACCACCCCAGCCTGTGATACTTTGTTATGGTGACCCTAAG 

AGGCAAATACACCCTCCTTTCCCCAACCTCTCCCCTCAGACGAA 

ACCGATGCGAAAAGTGCTTCATGAAGTTTCAGGTAAAGAAGTCT 

GGGACGAAAAGGGATAGTGAGGATGGCGGGAG [A/G] GGCTGAA 

CTCCAAATGGGCTTATCAAGGCTCTGCAAAATGGCGTGACGGCG 

CTGCCCCCTTCTGGTGGCCTGAAGACTAACGCACATGATGTCAA 

GTGCGGGGCCCAAGTACTCAGGAAAAGGTTCTCATTTGGACACT 

GGGAGGTCTTACATTGGGGGCCCTGAGCCTCCAGCCCTTCCAAA 

TCTATTCTCAGCAGGAGCTCAGCCACACCTGTGTCCCAGAACTG 

AGGCCAGGCCCAGCCTTCACTCCACGCCCAGCCAGCCCCAAGGA 

ACCGACTCCCTGAGGCTCTATGCTCCCTGCCTCCAGTGGCCCCG 

TGTCTGGGAAATAGTGGCCCTGGCCTGATGCCCTGACCTGGGCA 

ATCCATCCCCTGGTCCTCTCAGCTCCCGGGCCCAGGTTTTCTGG 

GCTACTTTAACCAGGGCAAACTCATTCCTCGAGTACAAAATAAA 

AGATTCGAACAGCATAATCAAATAGGTCATACCCATAAATCAAC 

ACATTTGAGCACCTAf TTTGTTGTTCTTTCACTAATCCAAACCA 

TATTTATTGAGCATCTACTATGTGCCATTCTCCAGTAGCCATTC 

TAGGTGCAGGGGATACAGCAGAGACCTTGAAAAAAGGAACAGTC 

TCTGATCTTGCTGAGCTTAGAGTCAAGTGGAGGTGAGGAGGAAG 

GAAATGAATTAACAAC TAAGTGAAGC AGAAGGTAAC C AATTG AT 

TGACTGACGAAGGGGTACAAACAACAAACACCTTCCTTTCTCCA 

AACTCTATCTTTAACTGTATTCTCTCGTTTTCCTTCCTCTCCAT 

TTTACAATCATTTTACAACATCTCTGGCTATTCTCCTATATTTC 

TGATCACTTCGGTTCTCATCACAATAATAATTTCAGTTTTCAAG 

CATTGGAAAGTCCCATCCAATTAAAATGTCAATCTCACACGCAG 

TTTAAACGTTTCGCCTGCCCGTGAGCTCAGACCTGTCTTGGTGC 

CTCAGTTCTTGTGTGGAGGGGAGGA 


SEQED 
NO. 166 


KCP 12 96 
90 


TGGTGGCCTGAAGACTAACGCACATGATGTCAAGTGCGGGGCCC 
AAGTACTCAGGAAAAGGTTCTCATTTGGACACTGGGAGGTCTTA 
CATTGGGGGCCCTGAGCCTCCAGCCCTTCCAAATCTATTCTCAG 


SEQED 
NO. 167 



INSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-142- 



PCT/US2003/034681 





CAGGAGCTCAGCCACACCTGTGTCCCAGAACTGAGGCCAGGCCC 
AGCCTTCACTCCACGCCCAGCCAGCCCCAAGGAACCGACTCCCT 
GAGGCTCTATGCTCCCTGCCTCCAGTGGCCCCGTGTCTGGGAAA 
TAGTGGCCCTGGCCTGATGCCCTGACCTGGGCAATCCATCCCCT 
GGTCCTCTCAGCTCCCGGGCCCAGGTTTTCTGGGCTACTTTAAC 
CAGGGCAAACTCATTCCTCGAGTACAAAATAAAAGATTCGAACA 
GCATAATCAAATAGGTCATACCCATAAATCAACACATTTGAGCA 
CCTATTTTGTTGTTCTTTCACTAATCCAAACCATATTTATTGAG 
CATCTACTATGTGCCA [G/T] TCTCCAGTAGCCATTCTAGGTGC 
AGGGGATACAGCAGAGACCTTGAAAAAAGGAACAGTCTCTGATC 
TTGCTGAGCTTAGAGTCAAGTGGAGGTGAGGAGGAAGGAAATGA 
ATTAACAACTAAGTGAAGCAGAAGGT7^ACCAATTGATTGACTGA 
CG AAGGGGTACAAACAACAAACACCTTC CTTTCTCCAAACTCTA 
TCTTTAACTGTATTCTCTCGTTTTCCTTCCTCTCCATTTTACAA 
TCATTTTACAACATCTCTGGCTATTCTCCTATATTTCTGATCAC 
TTCGGTTCTCATCACAATAATAATTTCAGTTTTCAAGCATTGGA 
AAGTCCCATCCAATTAAAATGTCAATCTCACACGCAGTTTAAAC 
GTTTCGCCTGCCCGTGAGCTCAGACCTGTCTTGGTGCCTCAGTT 
CTTGTGTGGAGGGGAGGAGAGGAGAGGGGAGGGGAGGAGAGGAA 
AGGAGACCGGGGAGGTGGGGGGGGAGAGGGGAGGGGA 




KCP_13 03 
09 


CTATCTTTAACTGTATTCTCTCGTTTTCCTTCCTCTCCATTTTA 

CAATCATTTTACAACATCTCTGGCTATTCTCCTATATTTCTGAT 

CACTTCGGTTCTCATCACAATAATAATTTCAGTTTTCAAGCATT 

GGAAAGTCCCATCCAATTAAAATGTCAATCTCACACGCAGTTTA 

AACGTTTCGCCTGCCCGTGAGCTCAGACCTGTCTTGGTGCCTCA 

GTTCTTGTGTGGAGGGGAGGAGAGGAGAGGGGAGGGGAGGAGAG 

GAAAGGAGACCGGGGAGGTGGGGGGGGAGAGGGGAGGGGAGGAG 

AGGGGAGGGGAGTGGGGGAGAAGGGGAGAAAAGCGCAGCTGGCT 

TCCTCACTCTCCTTTCCTTCCTGACCATCCTTACCCTGGCCCAG 

GGCAGGAGGAGGATTGGCAGAGTAGA [ A/ G] GCAGGGTCTTCTG ; 

TCTTAGCTGGGCCTGTTGGTGACTTTCTGTTGGCCAACATGGGC 

TGACTGGAATGTTCTCGAGCATGGCACATGGTCATCCAGATGCA 

GGCTCTTCCCTGGGGCACTATAGCAGAGAGGGCTCTCTTCCAGT 

CTATTGCAGATGGATGCCCTCGTGAGCTGAGTTTTGATGAAGAT 

CCCATGTCCCCAGCCACCCCATTCAGAGCCTCTTTCTACTCTGG 

TCCTCTGGTCCCAGCAGCAGCCCTCTGGGTACTGAGGGGAGGGC 

ATCTCACCCAAGCCCCTTAAACCTGCTCACCTTCTTCAGAGCCC 

ACGTGGCCGCAGGAAAGTCACAAACCCTTGTGCTCCCACAGGGC 

ACACGTGTGCACACGTGTGCAGCTACCTTCTCTCTAGTTGGTAC 

CTGAGGCTGCCTCCTGGATTTTCCAGTCTCTGTGTTCCCAGACA 

ACCCCAAGCCCCAAGAATACAA . 


SEQID 
NO. 168 


KCP_13 05 
57 


AGTTTAAACGTTTCGCCTGCCCGTGAGCTCAGACCTGTCTTGGT 
GCCTCAGTTCTTGTGTGGAGGGGAGGAGAGGAGAGGGGAGGGGA 
GGAGAGGAAAGGAGACCGGGGAGGTGGGGGGGGAGAGGGGAGGG 
GAGGAGAGGGGAGGGGAGTGGGGGAGAAGGGGAGAAAAGCGCAG 
CTGGCTTCCTCACTCTCCTTTCCTTCCTCACCATCCTTACCCTG 
GCCCAGGGCAGGAGGAGGATTGGCAGAGTAGAGGCAGGGTCTTC 
TGTCTTAGCTGGGCCTGTTGGTGACTTTCTGTTGGCCAACATGG 
GCTGACTGGAATGTTCTCCAGCATGGCACATGGTCATCCAGATG 
CAGGCTCTTCCCTGGGGCACTATAGCAGAGAGGGCTCTCTTCCA 
GTCTATTGCAGATGGATGCCCTCGTGAGCTGAGTTTTGATGAAC 
ATCCCATGTCCCCAGCCACCCCATTCAGAGCCTCTTTCTACTCT 
GGTCCTCTGGTCCCAG [C/G] AGCAGCCCTCTGGGTACTGAGGG 
GAGGGCATCTCACCCAAGCCCCTTAAACCTGCTCACCTTCTTCA 
GAGCCCACGTGGCCGCAGGAAAGTCACAAACCCTTGTGCTCCCA 
CAGGGCACACGTGTGCACACGTGTGCAGCTACCTTCTCTCTAGT 
TGGTACCTGAGGCTGCCTCCTGGATTTTCCAGTCTCTGTGTTCC 


SEQID 
NO. 169 



JNSDOCID: <WO 2004041 1 93A2J_> 



WO 2004/041193 



-143- 



PCT/US2003/034681 





CAGACAACCCCAAGCCCCAAGAA.TACAAGAGCTCTGTCACCAAG 
CATCGGGCCTGTGGCTGCACTACACGTCTGCAGCTCAGGACCCC 
TGGCTGCGGCGTAAGCTACCAGCATCCCCTTCTCATGGGCACCC 
TCATCTCCGGCTCCCCATCGCTGGGCTGTGACCTGCGGGGGCGC 
CCCTCTATGGAAGGGAAGGAGAAAAATTCACAGTGCTATCTACT 
CCTCTGAATGCACTCCCACCAATTTCCTTGGAAATTTCTAGCTT 
TCACTGACATATCTGGGATGGGGCGGTGGTCACAAAA 




KCP_1312 
44 


GTCTCTGTGTTCCCAGACAACCCCAAGCCCCAAGAATACAAGAG 

CTCTGTCACCAAGCATCGGGCCTGTGGCTGCACTACACGTCTGC 

AGCTCAGGACCCCTGGCTGCGGCGTAAGCTACGAGCATCCCCTT 

CTCATGGGCACCCTCATCTCCGGCTCCCCATCGCTGGGCTGTGA 

CCTGCGGGGGCGCCCCTCTATGGAAGGGAAGGAGAAAAATTCAC 

AGTGCTATCTACTCCTCTGAATGCACTCCCACCAATTTCCTTGG 

AAATTTCTAGCTTTCACTGACATATCTGGGATGGGGCGGTGGTC 

AGAAAATCAATCCCACTTTCCCTCGGCTAGTCTTACAAGCACCC 

AAC AGCTCTATTCAGAATACAGGGCTGC CCAGCTACTTC CCATT 

CATTATCCCCAGGTTGCAAGCTTTAGTCAAAACCCAGAGGCAGC 

AGGGTGTCTGGTTCCACCTGCTGTTAGGATGATTTCAGGAGTGC 

AAAGTGTTAGAAACGC [A/G] GTAAAACATGATGCTTAGAGATT 

AAGTGGGATGGGGACTGGGCAGATGATGCTGCTTTGGACCCAGC 

GAGTGAGGTGAGACTGCGACAAGACAGAGCCACTGAGCAGTGAC 

CTGGGGGATGGGCATTGCAGGCAAGGCAGAACCCCAAGTGGGAA 

CAACCTCACTGGGCTTAGCAAAACTAAAGAGGCCCAAAGTATAC 

TGAGCGATGAGGTGAGTGGCGTGGGATAAGGTTGGAGAGGAGGC 

TGGAACCAGACCCTGCAGGGCCTTGCAGGTGATGGGAAGGAGTT 

TGGAAGGTGCTGGAAGGTTTGAAGCAGAGGAGGGATATGATCAT 

GCCTGTAGCTGCTATGTAGAACAACTGTATGCATGCCAGGCCTG 

TGCCACGCATGCTCTAATCATTACTGGCTTTAACCCTTGCACTA 

ACGTTGTCATGCAGGTAGGAGCATCTGCACCCAGCAAATGGAAA 

CTGAAGCTCAGGAATATTCAGTCACTTGTCCAAGGCT 


SEQID 
NO. 170 


KCPJL318 
54 ' 


ACCTGGGGGA.TGGGCATTGCAGGCAAGGCAGAACCCCAAGTGGG 
AACAACCTC ACTGGGCTTAG CAAAACTAAAGAGGCCCAAAGTAT 
ACTGAGCGATGAGGTGAGTGGCGTGGGATAAGGTTGGAGAGGAG 
GCTGGAACCAGACCCTGCAGGGCCTTGCAGGTGATGGGAAGGAG 
TTTGGAAGGTGCTGGAAGGTTTGAAGCAGAGGAGGGATATGATC 
ATGCCTGTAGCTGCTATGTAGAACAACTGTATGCATGCCAGGCC 
TGTGCCACGCATGCTCTAATCATTACTGGCTTTAACCCTTGCAC 
TAACGTTGTCATGCAGGTAGGAGCATCTGCACCCAGCAAATGGA 
AACTGAAGCTCAGGAATATTCAGTCACTTGTCCAAGGCTCCCCA 
GCTGTTAGGTGCTAAGGCTGGATTCAATCCAGGACTTGCAGACT 
CCAGTATCTTGGCTTTTCTAACGAGAGTGTGCTAGCTTTCTAAT 
GGGGGTGGGGAAGGCA [G/T] TCTGCCCCCCTCCCATGGCAC CG 
TGAGCAGGTGTCACTGCTCCAGCCAGTACGCCTGGACACCGACT 
AGGAAGGAGTATGTGCTACTAGGAGGGATGGTCTGGGCTGACTC 
TTTGAAGTTGACAAGGAGTTGCATAATCCCAGCTAATAATTATG 
CTGGAC CAGGGGCAGAGACATTACTCCAAGGGTGACCAGG TGTG 
GAGAAGAGGCTGCTGACTCCGGGGCCCCAGGACCTGGCCCCCAG 
GTCTCATTGCCCGAGTGCTGCCCCAGAAGGAGTAGAAGCTGGAG 
CTGTCCGGGCCACAGCCGAGGCTGGGTGAATGCTGCAGTGAGGC 
TGCCGCACAAGTTGCGTGTTGTGACATTTGTCTTCTGGAGGGGA 
TTGGGATGGGCTACTTCAGCATTTAAAAACCCCTACTAGGTCTG 
AGAAATCCCCTCAGCTTATGAGCCTGGGTGGGCAGCAGGCCTTC 
TCAAGAAGCCCAGAAGGCCAGATGCTCACTTCCCAGG 


SEQID 
NO. 171 


KCP_132 6 
77 


CAGTGAGGCTGCCGCACAAGTTGCGTGTTGTGACATTTGTCTTC 
TGG AGGGGATTGGGATGGGCT ACTTCAGC ATTTAAAAACC C CTA 
CTAGGTCTGAGAAATCCCCTCAGCTTATGAGCCTGGGTGGGCAG 
CAGGCCTTCTCAAGAAGCCCAGAAGGCCAGATGCTCACTTCCCA 


SEQID 
NO. 172 



3NSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-144- 



PCT/US2003/034681 





GGCTCTCTTGCGGCTGAGCTGAGAGCAGGCACCTGAGGCCTGGC 
AAGTGTGACAGCTGGTGACACAGACAGACAGGGACAGGGAGATG 
GGACTGTGCCTGCAGCGGTAGCCCTGGCCGGTGTTCAGTGGGGC 
CAGCATCCGTGTCTTTCCTGGGGGCCAGTGGGGGCCGTGGCTCT 
GACGATGCATCCCTCCCCCACGTTTTTTCTCTTCTTGTCTTGGA 
CTTTGCAGGGAGCACTCTGCTTTTGGGAACAGGAGCTGGGTCTC 
TGGCCATTCTCCGCAGCCCCTCACCATTCACTCAGTGGCTCTCA 
AAAAATAGAACCTGGG [A/G] CAAAGCTGTTCTTGGCCCCAAAC 
AACATGAGGAAAAATAAATAAATAATGTACCTGGTAACTGAGAG 
AGTTCCCTCTGCATCTTGGGCTCTTTCAATGAGATGTCCTCTGC 
CTGCAGCAAGCCCCAAGGGCTTCCCTCACCAGGACCAGCACCCT 
GGTTTGCCTGACCCCACACCTGCCAATGCCGGGGCAAGAATGTC 
CCAGGCTGCCCTGGTTCCCAGAGCTGATGCTTCCCACAGTGCCC 
AGCTGTGCTGGC ATGGAGCTAAGGACAGGGCCAGTC C CAAGAAA 
ACAACAAGGCTCCAGGGCCACCGGCCACTGCTCAGGACCCTGGC 
TGACCCCACAGATGCGGAGTGCCTGAGATGGCTCATGGGTGACC 
CCCAGGCATCTGGCAAAGGTCACAATGGCTGTTTGGCTTGAAGA 
CAGCCCTTGCAAGATCTGTTTTGAGCCAACCTGTGGCATTTAGC 
CCTCCCTGGGTGACAAATAAAAAGGCTGAGGCTTGTA 




KCP_134 0 
45 


ATTGGAAGATAAGAATGCAGCCAAAGAGGTCTTCAGGGTAGCGT 
GTGGCCTGGGCGCTGAGACTATTGGGCCTAGCAACTTCTCAAGC 
AGTCTATTAACCACAGCCGGTAGCCAGCTTTTCCCCGCCCTTCT 
C C CAGGC ACACACAGCC ACCTC CATCACCAAAGGTCAGGCGAAC 
CACCTCCCATGGCTACCCCCAGCCTGACTTGCTTTATAGAAATC 
ATGGCATCTCATCCTCACAACAGCCCACACTCACAGTGAATCTT 
GGCCATTATGACAACTGGGGACACTGAGGCTCGGAGTGGTGGAA 
ATTCTCAGAATCACATAACAATAAGTGTTAAAGTCAGAATTTCA 
ACTTCATCTCTCTAACTCCAAAGGGCGTGTGTGTGTGTGTGCGT 
TTCTGGCCATAATCATATTGTGCCCTACAAGCCCCAGTGAGGAA 
TCTGCTAGGAACACTGGTTTGGGGAAAAAATGTAATAAAATATG 
TGATCCAGAAGGCGGC [C/T] TTGGTACCTGTCATAAACCGCAG 
CATGGGGTACTCACTATGCCTGGGGTCTGGGCTCTGAAGGCATG 
ATTGAATGATCTCACTGCAGGCCTGGTTGTCCTGCGAAGACACC 
CGTCAATACATGAATATTGACACACAACGCTGCAGTGCACGCGC 
TTCTGGCAGGGGAGCTGCTGCACTCGAGGGCAGCTCAAGGTTAA 
TTTGCAGGGTTCATGTTTGGAGTTTCTGAGCAAGTGTTGCAGCT 
TTGGCCCCCAGCCCCCTGAGGGGAGCTCTGGCCGTGCATGAGGG 
TCAGACAGAAAATCTCCTTTCCTCCATCCAGGCCTGCAGTCTGC 
AGCACTGAGGTCAGCGCTGGCCACAAGCCCACCCTGTGCCTCGT 
CAGCCCCACTGAGCCTCTCCATCTATCATGCCACAGGCTGACCC 
TGAAATGCAAAATCATTCTGTCCTCCCGCCCTCCACTCCCACCT 
CGCACATCTATGGATTTGCTGTTCAGAAAACATCTGT 


SEQID 
NO. 173 


KCP_13 55 
18 


AGTTAGATTGAGCTGGGTCCCATCTTGGGCCTTTGCTGGTCCCT 
CCCTAGAAAGTCTGCCCCCTCCCCCTGCAGGGTGGCATCAGCAT 
TCAGGCCTGGCCCTGACGCCCTCCTCTCTGGGCCACCTTCACCT 
CCACAACCCCGGCACCAGCACCCATCCCCACCACATCCCCAGCA 
CGCAGCATCTAGTAAGGGCACCAAATGCATGCCCAGACATATGA 
GTGAAATGAATTAACCCTGAACCTGAAAAAGGGCAACCACCACA 
CAAGATTCTCTAGAAACAATGTGAATTGTGCAGAAGGAAATTAA 
CCCTACTCCATCCAGCCCATCCTAAGGCACaCjGAL. 1 ± OvjACCTGT 
TCCTCTTGATGGGGCTGGGGCTGAGGCGGGCAAGGCAGGCAAGT 
GCTGAACAGTTGGCAACATTGCCCATCCCGTCTCCCTGCACCAG 
GCTGGGCCTGGGGTGAGGGGGTGGGGGCCGGGGTAGCTGGGCTC 
CTCCAGCAAAGAGCAG [G/T] ACTGAGTC CCTGGTGACTATTAG 
GTAAAAGGTCCCTGACAATTTTGAGGGGCCAGATGCCAACTCGA 
GGGATACAGAGAAGATCTAGGCACAGTCTTTCCCCACCATGTCA 
GACAAAAAGGTTAGATACAGGACCTGATATGTTATAAAACTCAA 


SEQID 
NO. 174 



BNSDOCID: <WO 20O4041 193A2J_> 



WO 2004/041193 



-145- 



PCT/US2003/034681 





TCAATATTTACTTAGTGAATAAATGGACGGATGGATGGATGGAT 
GCATTAGGCAGCCAAGTGGGCAGCACCGATGACTTAATGTACTG 
AGTGCTCCGACTCCAGCAACATGCATTCATTGTTCCTACTGTGT 
GCCAGTGAACAAGAGCAATGAACTCAATGACTTCTGCCCAGGGT 
GGGCCAGGGAACCAGGGAAGACTCTCCAAAAAGGCAGCATTTGG 
GCTGGGACGTACAGATGAGTAGGGGGTCGAGTGTGTCGTTATGT 
CGCTGGAGCC CAGAGGCGTC C ATCAGGACTTGGGGG AGGGCAGA 
TGAAAGGGCCTTACTGCCTAACTTGGAGCCACTGTAT 




KCP 13 60 
36 


CCCATCTTGGGCCTTTGCTGGTCCCTCCCTAGAAAGTCTGCCCC 
CTCCCCCTGCAGGGTGGCATCAGCATTCAGGCCTGGCCCTGACG 
CCCTCCTCTCTGGGCCACCTTCACCTCCACAACCCCGGCACCAG 
CACCCATCCC CAC CACATCC CCAGCACGCAGCATCTAGTAAGGG 
CACCAAATGCATGCCCAGACATATGAGTGAAATGAATTAACCCT 
GAACCTGAAAAAGGG CAACCACCACACAAGATTCTCTAGAAACA 
ATGTGAATTGTGCAGAAGGAAATTAACCCTACTCCATCCAGCCC 
ATCCTAAGGCAGGGACTTGGACCTGTTCCTCTTGATGGGGCTGG 
GGCTGAGGCGGGCAAGGCAGGCAAGTG CTGAACAGTTGG C AA.CA 
TTGCCCATCCCGTCTCCCTGCACCAGGCTGGGCCTGGGGTGAGG 
GGGTGGGGGCCGGGGTAGCTGGGCTCCTCCAGCAAAGAGCAGGA 
CTGAGTCCCTGGTGACTATTAGGTAAAAGGTCCCTGACAATTTT 
GAGGGGCCAGATGCCAACTCGAGGGATACAGAGAAGATCTAGGC 
ACAGTCTTTCCCCACCATGTCAGACAAAAAGGTTAGATACAGGA 
CCTGATATGTTATAAAACTCAATCAATATTTACTTAGTGAATAA 
ATGGACGGATGGATGGATGGATGCATTAGGCAGCCAAGTGGGCA 
GCACCGATGACTTAATGTACTGAGTGCTCCGACTCCAGCAACAT 
GCATTCATTGTTCCTACTGTGTGCCAGTGAACAAGAGCAATGAA 
CTCAATGACTTCTGCCCAGGGTGGGCCAGGGAACCAGGGAAGAC 
TCTCCAAAAAGGCAGCATTTGGGCTGGGACGTACAGATGAGTAG 
GGGGTCGAGTGTGTCGTTATGTCGCTGGAGCCCAGAGGCGTCCA 
TCAGGACTTGGGGGAGGGCAGATGAAAGGGCCTTACTGCCTAAC 
TTGGAGCCACTGTATGTTTCAAAACAAAGGAG [A/C] GAGAGGA 
TCCTGGGAAAGAGAAAGGGTACTCTAGGCAGAGGATGTGAATGG - 
GCACAGCACAGGTGAGAACATCAAGACCAGGGGTCAGGGAATCT 
ACTGGTAAAC AATTGTAC CC CAAGGGAGCAATCACAGCCTCTC C 
ATCCACAGGGAAATGCCTGGTGGGGAGGAATGGGAGGAAAGAAA 
CAGATTGCATGACTGTGTCTTGAAGGTCTAATTCCAGAGTACAG 
CATCACCCCTATCTTCCAGGTCCAGAAACTGAGGCTCAGAGGGA 
GACTTTCTGATGAGTGCAGCGTGCAGATAAGAGCATCTCCAAA.G 
CTACCTCCTTCCCCAGTCACACCAGGGCATAAGCAACTGATAAC 
AGCTGTCAGCACGGGACAGTGGAGGGAACACTAGGTTAGGAATA 
AGGGTACGAGGCTTGAGTACAGATTGTCAATGACTCAGTGTGTG 
AACTTGGTCAGGTGACTCCAACCAGATGACTTCCTTCTCTGAGC 
TTCTGTTCCCTCCTCTATGAATGGGGACAATCACTCAGCTTCAC 
AAAACAATGGCTGCGAAATTGCCTGGTACAAGAGAGAGAACTTC 
CAGTGTGTAGGGGCTGTTGTCCTAACTGCCCAGCCCCCTAGATA 
GGTAGTTATGTCATCTGTGAAATGGGTGTTAGAATTCCTACCTC 
CCAGGACAGCTGTGGGCAGAAAACCAAAGAATGTGTGTGAGAGC 
CCAAGCACCATGCCTGGCACATAGTAGGTGCTCAGGAA^GGCTG 
AGGGTGCAGCTGCTGTCCACACACATGGTACCACTGCCCCAGGA 
. AGGGGCTTCAGGAACCAA.GAG CAATTCTGAG CACTGGTGACTG G 
ACTCTGCCATTCTCCATTTCAAACGCTTTTTGAAAGCAGCTCCA 
GAC CCAAGCAGGAGAGCAGGAGG CAAAAGAAACGCAGGG GCTTT 
CCCGAATGGAATTTTAGAAACACACAGAATTGTCTCCTGCACAG 
AAGGGAAGCTGTCTTCCACAGCACA 


SEQID 
NO* 175 


KCP 1376 
60 


CACTGGAGCTGAGACTCCCAGGTCCCCTAGGGCTTCTCTCCCAG 
GGGCCTCTGGGCTCCCCAAGGCCACGTGCTGCCCCGACTAGAGA 
CCTGGGCCAGTCCTGACCAGGGGAAAGAGTAGCGCCGACAACAG 


SEQID 
NO. 176 



INSDOCID: <WO 2004041 1 93A2J_> 



WO 2004/041193 



-146- 



PCT/US2003/034681 





CCCCAGATGGTATGTGCACTGGGACATACTGGCAGCTGCCTTCA 
TGACAGCAAGCCATAGGTCCAAATC CCGC C CCTTCACAGGGACA 
TTCCCAACTGGTCAGGGGTGGACCTCCCCTTCCCGGCTGTCTTT 
GGTGTCCAGGACGATTTGCCACAGACAGGGGGAGCTAAAGGGGC 
CCACGCTTGAGGCCGCTCAGCTCTGAGTCCTCGCCGGCCACAGA 
GGACCTTCGTGCCTGTCCTCTGTCCTCCTGCCCAGTCCCCAGGC 
CAGGCTCAGCTGGAGTTGGGGAGCAGAAAAACACGCATCTGAAT 
CAAGGCTCTCGGAGCCTTTGCTTCTGCCTCCAAGAGGCGAGGGA 
AAATGAATACCCAGGC [A/G] AGCGAGCAAGAGAGACCCTCAGA 
AAACCCCAGATGCCCCTGGAATCAAGCCCTGTCCCACCAACGCC 
ACGTGGATTGACAGGCTATTAGTCTTCCTGTAATTAGGATTCTC 
GCCTCAAATCTTGTATCTTTTTCCCCCAGAAGATTCTCCTCCAG 
CCTTCACCACTGCCCCCTGGCGCTTCCTTGCAAGGCTTTTGAAG 
AATCCTTTGCAGAGAAGCAGCCTCCTTTGGCAGGGGCTGCAGAG 
CACTCTGCCTCCCTAGGCCAGGGCGAACCAACAGAGGCGGGAGA 
TGAGGAGGAGCAGCGCGGCTCTGCTGCGTGGCCCTGGGCAAGCA 
CCACAACCTCTCTGGGCCGTTTGCACATTCTTACCGCCAGGGAT 
GTGGGCGGTAAATGAAAGAGACCAGCACAAACCAGTGTCAGCTC 
CCTTCCTCGATTCCTAAAATGTGATGCCCAAAGATGGGCCAGCC 
TCCTGCTGTGCCTTCTCTGGGGGGACATTTAATAAGT 




KCP_14 3 6 
12 


TGGCCGCCTCTTCCAGATAACAACTCTCCTCTCCTTCCCTGCCC 
TCCTGCTCCTCCTGTTCGCGCTACATAACAGACTCTGTGGGGCC 
TTGGTTTATGTATTTCCTTCTCTCCCCTACTGAAATACATGTGA 
GCGATGCTGGGGCAGGCCGACTAGAAGAAGCAGACTATCTGCTT 
CTTCTCCACCCTTAGAATGGTGCTGGGCCCAGAAGAGGCATGCA 
GTCGATATTTGCTGAATAAATGAATGTCAGATAAAGTGGTGTGG 
GGACTCC AGGGGAAAGATTTGTCATTCTC C ACC CTC CCAGTTCA 
GCTTAAAGCAGAGAAGTGAGAGGTGCCCAAAAAGGGGTGTGTCT 
GGGGGGTGGGGGGTGGGGATGTTCCAAGATCTCCAAGGCCTGGA 
TTTTAAGCAAGGTTTGAGATGCCAGCAAGAGGGCCTGGCATTGC 
CAGATTGATAGTCTGCATTTCAGAGAAGGACAACCCCACCTCTG 
ACCTTAGCCC [A/G] AGCCTCAACAGCCTGCTCAAGGAGATCCA 
CCCTTAGTAGGAGGAGGCAGCCAGGCCAGGTTCCAGTCCCTGCC 
ACCGCTTGCCAGGTGTGTCTTGGGCAGCAGTTGCCTTTGCTCGG 
TGGTCTTCAGCTTTGCCCCCTGCCAGGCACGTGCTGGCCTCCTG^ 
CCTGCATCGTAGCTCATGGAGTCCTCTCAGTCACCTCTGTATGC 
CCTGCAGCATCCCCAGTTCTCAGTGAGAAGAGTGTGCTCTGAAA 
GTTAAGTAACTTACCCAAGGTCACACAAGGTCTGAGTCTCAAAT 
GCATACAATTTGACCCCATAGTCTAAGGTCTTGACCGCAATGGA 
ATAAGAAATTATTTTACCATTCTGAGTGGCAGTCTCTGAAGACT 
ACAGCAATAATTGATGCCTCTCAGGGGGATAGGTGTGTCACTTA 
CAGGTGATAGTGAGGTTGTCCTCAGCCTCCCTGCTCTTCGTTAG 
ACCTCCCTCCTCCTCTCTACCCGGGCCAAGCGT 


SEQID 
NO. 177 


KCP_JL449 
60 


GCGGAACACCTCTGCCGCACCTGCAGCAGCCTTGCTCTATTTCT 
TCACAAGCTTCCCCATGACACTGACCCAAGGCTGTCTGGCCACT 
ACAGCTGCTGATGATGATTAGCAATAATAATAATAATAAACGAA 
ATGCCTTCTGCTTAGATCATCTTTAATTTCCCCTCCAGAATGAC 
ATTCGACTCTGCTTAGAGTTACAGGCAGCCCAGCAATTACTGAG 
CGCAAATACCGTGTTCACCCGCCTCACCTCATCCACGCCCCCAC 
AACACCCAGCCCTGAGACTGGCTCCACGATCACCTCCACTTTAT 
AAAATAAGATATCAAACTCTGAACAGAACGGACGTCTCAAAAAA 
TGGGCATATTACATTTAAACCCTCAATCTGTTGGGTATTTGAGT 
GAAATGGACATACCTGCAGGGAGTCGGTGGCGAGGGCCGGCTCT 
GAGGACTTCCTGGGTTGGGATCCTGGCTCTGCAGGACTGCGTGA 
CCTTGGTGAGTTACTT [C/T] ATCCCTCCAAACGCGCTGTTCTC 
CTTCATAGAATGGAGATGACCACAGGGCCAGATTCATAAGGTTG 
TTCCTTGTAATACAGGTGAATATCCATACCCAGCAACTGCTGGA 


SEQID 
NO. 178 



BNSOOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



147- 



PCT/US2003/034681 





CCACCTGTGGTTTCAAGGATAATTTCCCTCCCACGTCCCCGTG^ 
CCCTTGGAACCTTCCTCTCCTCCTGTCTCCCCCTGCCCCCATCA 
CTTTGTAATTGAAAAGTCATGATTGCTCTCCCAGGTGTAGCACT 
GCTCACAGGTCAGATTGCCTGCTCTGACGTAGTGACTCAGTTGG 
ATGCGGTTCAGCTGTGTATGATCAACTCCCTCCCCCTGACAAAA 
ACATTATTTTG CATCACAGAGAAGT^GATTTC TTT CACAC ATAA 
AAGAAGGCAAAAAGTGGTGCCTAAAGGGCTGGTAC2AGCAGCTTC 
AAGAAATCAGGAAGAACCTGGGCTCCTTCTGCCTTCTTGTTCTG 
CCAATATCACCCCATGGCTGCCACTTCATGGCCCAAG 




KCP_14 67 
46 


TTGTGAGTAGGGCACGCAGGGAAGAAACCTGTTCAACCCAGCCC 
CGTGCTAGAAAGACATCAGCAGGGCCTGCAAAAGCCCTGATTAA 
ATCTCACAAGTTTGCACCTGGAGCCGCCATCTTGAATTGCAGGT 
GAATATCAGCCTTTGGTTTGGGCTGTGTGCCCCAGATGATGGTG 
GTCCCAAATTACATAGGCCAATATCCAGAGCTGGGTTAAAATGA 
AGCATTTCGAGGAAAAAAATGCAATGAAATTTGTTTAACCGGTA 
CTTCAGGCTTTTGAGCACAGAACAGCGTCCATCCCTCCAAACAC 
ACACTGAGGATATACACTTAGCCAGGAGGGAACATAAGGAGGGG 
TGGACAAGCCATGTTTACTAAAATCTCTCAGTGTGTGCCAGGCA 
TGTTCATGTATATTCAGGAAGAAGTGTCAGTATTTAAGATCCTC 
GGCCCTTGCCCGAGTCCCCAACACGCCTTCTTGTCTGGAGAACT 
GTAAATCTTGGAAACATCTTGCAAGGGGGGACACCTCACAGAAG 
GCAGGCTTGGCATGGGATAAACAGAATCGACTCCTCTGCTTCCT 
TCTGATGCACAGTGAATGGGCAGGTGGAAGCATCGTTGCTTAAA 
GAGGAACCAAAACTCCACCCCAGAGCTGCTAATTCCTTTTGGCT 
TGCAGTTATGCAGAGGGCTAAAAAATCCAACGAATCACAAATCC 
CCTGGTTGCTAAGTAGAAAGAATATGTTTTGGCTGCTGCTGTTC 
CCTTCCCCAAGGAAAAGATTCAAGCAGAGGCGGTCCCCACCTCT 
CAACACAGAAAGCAACATCTCTGATTGCCTCTAGACACACCTTC 
ATGCTCGTGGCACTTTGGGACCCTCTGCCCGCTGGCTTATGGGC 
ATGGCTTCCCCATCACTCTGGGTCCTTGGGAAGAGCCTCTTTCC 
CAGACCCCACCTCTGTGCCTCATCACATTTCTCCCAGGCTATTG 
ACTTGTTCAAGGTTAAGGTATGAAGAGAGTCA [C/T] GCAGCAG 
CGCTACCTGGCTCTGCTCTGCTGGGGGAAGCCTTTTCAGAGCCT 
GCCTCTTCCTCAGCATGAGGGGCTGCTCGGGCCCAGTCCCAGAG 
GCCATGCTGGTCCCAGGGGAAGGTGGCCGTCATCCCCATCTGTG 
TTTTCTCTTGCAGGTAAGTCATGCTCCAGCAGTCGGGAGGGTTG 
TGTGATGACACACTTGGCAGTTTGGGAGCAAAAGCCGCCACAGT 
AAGACACAATTGATTCATTGC CTCTCAAC CCTCTGCTGGGGTGG 
ACTTTCATGCGTGGACTTCTGTCCCCAAAGAGGCTTCTCTGGGT 
CTGGAAAGGGCCCTAGCCTTGGTTGGGGGAGGCAAAGGGGTGGC 
GGCTTCCAGGTACCATCTGGCCAGGAACCGGCTCCATTGTCTGT 
GCATGTAGCTTGCACTGGGCTGCCTGCTCCAAGGGAGGCATCTC 
CCCACGATCTACGACATTGGCTTCAAAGAGCTGCTCCTGGCAGC 
TTCGAATGGCTGAGACCTACTGGCATGGGATGGAGGAGTGCAGG 
GAGCTTCCCGGGACCTCGCTAGTCCTGCCTGGATGCTCAGAAGG 
CCCTCGTCCTCGGTGGCATGCAGCCTCGGCCATTTCCAAACTCA 
CGGCATCTCACCCAGCCATGTCACCCACCCCCGGCTCTGTCGCC 
CTTCCCATCACCTTTCTCCCACCCATCACCTCACATCAAGGTTT 
CAGCCAGCGGGAACCAGGTTTAGACTCCAATTACCTGTGCGTGT 

CGAC ATTTGATTTCTGGT ACTGAAATGTCAAAGGGTCCTGAGG C 
^CCCGCTAGGGCAGCACGCGGAGCATCCACCTGCGTGCGCATCC 
TGGG CTCTCTCTGGGC CACTTGGTGCTGGGG ACATGC CGGGAG C 
TGGTGGTCAGCCCTCCTCCTGCCTCCTCAGTGCTGCATCTTCAC 
CTTCTGCAGCTGCCTACCAGAAGCA 


SEQID 
NO. 179 


KCP 1492 
16 


ACAC CTTGACTTTAGCCCAGTG CAACTGACTC CAC ATTTCTGG C 
TCCAGT^ACTGTAAGAGAATACATTTGTGTTTTGTTAAGCTAGCA 


SEQID 
NO. 180 



JNSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-148- 



PCT/US2003/034681 





AATTTGCAGTAATTTATGACAGCGCTATGAGAAACCAAAACACC 

AGGATTATGCCCCAAGGATCCTGATGCCCTCCCTCCTCTCTGCT 

CTGCAGTGTGCTGGAGCTCACAGGGCTCTGCTGCTGGGAGTTAG 

TATCTAGTCCAACACTTTACCCACTCACCCCCCAAGCTAAGGGA 

CTCCTGAAATCAGGGACCAGATGCATAATAGGTGCCCAGGAAGT 

GAGACTCG CCTTC CCCAGATTAAGAATAAAGAAGACAAACT ATC 

CACGGCTGCTGTGAGCCTCTCATCAGACCTCAGCTTCTAGGGCA 

GGGTCCCTGCCTGTCTCCAGTATGTGGCCTCTGTGTCTTCTTCG 

CCCTCCATCCCCACAGTGGGACGAGAAGTCATCAGGAAGGCAGG 

GGATCTGCAGGCAGCC [A/G] TCAGGGCTCTAATTGCAGCTGGC 

TGGGGGACCATGGGTCAGGGCTGCCACCCCCTGGCTCTGTGCCT 

TCACCTGTGTAACGAATGGGGCACTCACAGCCCCTCTCAAGTGG 

TCCTGGGGATGAAGTGAGAAGGTGACATATACAAGTGAGTTATA 

CACGTTCCTGTTCTGTCACTCACCAGTGCTCACTGGGTGGGTCA 

CTGAACTCCCCTCAGCGTTTCCTTCTCCATCTGTAAACCACCAG 

TGCAAACCTTTCCCAGATAGTGCTGACCCGAAGCAGGAACCAGT 

GCCCCTCTGCCCTCAGTAAGTCTGCCAGCAGAGGAAGCCCATAG 

AGGGTCTTGGGAAATGAAGCCAACAGAGTCAAGAGGGTCAGATG . 

ATGAGGGACTTCAAGTGCCACCTTCATCCCATTCTTTCTGCAAA 

TATTCACCACACACCTACGTGACCTCAGGCTCTGTGTCAGGTCC 

TGGGGATGTAATGGTGTCCATGAAGAAACAAGGTCCC 




KCP 1495 
35 


TCCCCAGATTAAGAATAAAGAAGACAAACTATCCACGGCTGCTG 

TGAGCCTCTCATCAGACCTCAGCTTCTAGGGCAGGGTCCCTGCC 

TGTCTCCAGTATGTGGCCTCTGTGTCTTCTTCGCCCTCCATCCC 

CACAGTGGGACGAGAAGTCATCAGGAAGGCAGGGGATCTGCAGG 

CAGCCATCAGGGCTCTAATTGCAGCTGGCTGGGGGACCATGGGT 

CAGGGCTGCCACCCCCTGGCTCTGTGCCTTCACCTGTGTAACGA 

ATGGGGCACTCACAGCCCCTCTCAAGTGGTCCTGGGGATGAAGT 

GAGAAGGTGACATATACAAGTGAGTTATACACGTTCCTGTTCTG 

TCACTCACCAGTGCTCACTGGGTGGGTCACTGAACTCCCCTCAG 

CGTTTCCTTCTCCATCTGTAAACCACCAGTGCAAACCTTTCCCA 

GATAGTGCTGACCCGAAGCAGGAACCAGTGCCCCTCTGCCCTCA 

GTAAGTCTGC CAG CAG [A/G] GGAAGCCCATAGAGGGTCTTGGG , 

AAATGAAGCCAACAGAGTCAAGAGGGTGAGATGATGAGGGACTT 

CAAGTGCCACCTTCATCCCATTCTTTCTGCAAATATTCACCACA 

CACCTACGTGACCTCAGGCTCTGTGTCAGGTCCTGGGGATGTAA 

TGGTGTCCATGAAGAAACAAGGTCCCTGCCCTCATAGAGTGGCC 

TGAC ATATGCCCGAGGCAGTCAGCAG CCG AGTGCGGGAG ACT CT 

TGAGCAGAGATTGAGTGTGTTGATATCTGTAGGCATCAGCCTGG 

CTTTGCTGAGTGAGCTATATCAGAGTGGAGGAGGCCAGAGGCAA 

AGTCCAGACTCCACTGGATCCTGGATTGAGGGGAGAAGGGGCTG 

GGCGGAGGAGCAGCCTGAGCACCTGCATCTCACTCCAACTGGGT 

GCTGATTTGTCCCCATGGCCCCAGCACCCAGGCAGGTCACCAAG 

TAAGCTCAAGACAAAAATGATGAGTGACTCAACAGTG 


SEQID 
NO. 181 


KCP_15 67 
32 


ATAAATTGGATTTCATCAAAAATTTAAACTTCTGCTCCAAAAGA 
CAC T CTTAACAAAG GGAAAAAG CAAG C CACAATATGAGAGG AAA 
TATTTGCAAAGCATCTGATAAAACATGTGGATCTAAAATATGCA 
AGGAGAATAACAACTCTATTTTCCACTAAGGAATGAATGACTGT 
ACAAGGACCACATTCTAATTAGGAGCTTCTGAACCCAAAGGAAT 

AGTAGGGCTTGATCTCTGCCTCTGAAGGGCAGAGGGCGTGGACT 
ATTCTTGGCTCTTAGGGGACAGCTAGAGAAATGTGGGTCTCATG 
GCGACAACTCTGGACTCCATTGGAAGAACCTTCTAACAGTCAGG 
GCTCCCAGAGATAAACTAGACAAGTCACCAAGAGAGGCAGTGGG 
TACCCCTCACAGGAGGGGTGCAAATCAAAGCCAAGGCTTGGAGT 
GGACCATATTAAATCC [A/T] TTTCTTATCCTGTGATTCTTAGA 
GTCCTATCTGTATCAGGGGAAGGCAGGTGGGTTCTAGAACTTTC 


SEQID 
NO. 182 



WSDOCID: <WO 2004041 1 93A2_1_> 



WO 2004/041193 



-149- 



PCT/US2003/034681 





TAAATGTGTCCCTGTGGGTTTTTCCTTCTCCAGCTACACACAAA 
CTTGGGCCTAATAAGAAGTCTATGGCATTAACCCAGCAGGAATG 
CTTAATGCTTATATCTGACCTCAAACCAAGACTGTCTCCACAGT 
GAACAACCCCGTCCTGTCCCCTGGGCGTCTCCTTAGCAAATGCC 
ATCAGTCAATGGTGCAGCCATCTTGGAGCCCTTGCCATCTATAA 
TCTTCTACCGCCACCCCCCCAGCTGATTGTTTTCTTTGTATGTC 
TCCTTCCTGGACATTACTTATTCTTTACTTTTAAATATTTGCTT 
CCGTAAAAAAACAAATGAATGCCTCGGACAGATTTATAAAGAAC 
ATTCCTGGAGAGGCGGGTGGATTAATTATTCAGCATCCTCTCCC 
TTTGTAACTATTTATTGTCTCATATGCATTTATATGG 




KCP_1586 
17 


TTGCCCAAGTGATGTTCCATGTCAGGCTCTAGGGTCCCTGCAGG 
GACAGAGAGGGACTAACATTTACTTACATGCCTATAGTATGTCA 
GGCATATACTTGTGCCTTTATATATATCAGCTCTGTTTTTGTCA 
TTAAAACATCCCTGTAGAAAGATAGGCACTGCTGTCCCATTTTA 
CAGATGGGGAAAGCCAAGCTCTGAGTGGTTCAGCAAACCCTGGG 
TGCATAC CC CCACCTTGC CCCTGCAAAACCAACAAAAAAACGAA 
GGCCCTGCCTTCCTGGAGCTGACATTTAGGTTGATTCTGAAAGT 
CAGTAGGCCCAGATTTTCACTCTTCATTTTTCTTGTTTGGAATG 
AGAGAGCACACAGCTGGGTCGGGGGAAGGAGCGAGGGTCTAGGC 
CTGCATCCACTCACCCCAAAGGAAAGGAGTAGGGGACCAGTCTG 
CTGGACATGCAGACAGCGATTGGAGAAAAGTCAGCCCAGCTATG 
AACCCCATTCCTTTCAGTA [C/T] GAGCCAAGAGGGATGGCATC 
TGTCAGAGTTGCTGGATTTGGGATTTTGCATCTTGCCAAGTGTC 
CATGAGGAATTGGGGAAACTCTCCCCCTGGCTGGACTGAGGCTT 
CAGCAAGCATTGTTGCTGCCCAGTGGTGATCAGCTCAGTGTCCT 
TGGAAAAGAGCAGAAAGTGGTATCACGAACATATCTTCTCCTTT 
GCTTCCTTCTCCTCACTCTTGATGATCAT^ 

CAAATATGGATCTGTGAGGCTACCTCTGGGGTTGAAACTTGGTT 
TTGGGCAAAATTTGTGATGTTCTCTCTGCCCAATCCAGCCTCAG 
GCTACAAATGAATGTAAAAATCTCTAATTTAGTGC CAAGTAACA 
GAAAACAGCTCTACTTATCTTAAGCCAAAAAGAGGGACTTCTCA 
GAGGCATACTAATGGAGGATGGCAAGAGGGCCTCACGTGGAA 


SEQJD 
NO. 183 


KCPjL601 
45 


GCTCTTCTGCTGTGGAGGATCCATGCCATTGACCTAGGCACCCG. 

TTTTCCACATATTGAGCATTGCTGAGCACCTATTCTGTGCCAGG 

CACTGTGCTTCAGGGCCATGGGGGATGCTCCAAGCGGTAAAATG 

CAACCAAAG CCCCGAAGGAGCTCACATTCTAGTCATGTC CACAA 

AGAGGTAATAAATCCATAAATTGTATGTACTATTCTAGTCACAA 

TAAAATTGTGTCGTACTGTAATGCTGGGTATCCATTTTAAAACG 

GGGGG CATCGGCTG AATCTGGGTCATTACAGT AGG AAATG CAT A 

TATATAATCATTTACTCATGAATATTAATGTATTTAATGAGGGT . 

AAAAGATATTACTTAAAGCAAAGTATTCGTTCCAGCTACTGTTG 

GATTTGTTCATTACTGTTTCCCATGCAGATATTACCTGTGATTT 

ACCTGCATATCAAGCATCTGGAAGTAGCTCAAATCCACCTGTGG 

GTAAATTAGGTTAGCC [A/G] TTTGTTGGCAAAAATTACAGTGT 

TAACTAATTTCCAGGGTATGCTTGCAGTCAGTAGTTTCATACTT 

AGGTACATGACTTGCATTCACATCATCTGGTTAATGGTGTGAAC 

AGAGATTTTCTTTATGGTTTTTGGAATACAGTAAGATAATGTTA 

AGCTAACGTAAGTCTGTTAACAGTACCTGGTTCTGAACTGTATT 

TATAAGGTGT ATCATAAAAC CATTACTTTGGAGTTTG C CAAT CT 

TAAATTCAGAACAATTCAAAAATGAGCCAGAATCTAGTTTGCAT 

CATTACCACTTATAAAAATAAGGATCTGTAAGTTGGCTGGATAA 

AATATATTACAAAATAATGACTTAAGTGGCTCTGGAGCCAGCAC 

AAAAGATAAAAATTGGGTATACTCAAAATTAC CTTCAAAATATC 

TTAAGTCATTCTTAAAATACATGTAAATATGCCAACTCAAAATA 

CATC CAACAAAACTAATATTTTTCCCAATTTGTTGGA 


SEQID 
NO. 184 

• 


KCP 164 8 


TCGACGTTTCCAAAGTCATGGGGCCTATGGTTTGTGAGCTTATT 


SEQID 



3NSDOCID: <WO 2004041 1 93A2_L> 



WO 2004/041193 



-150- 



PCT7US2003/034681 



97 


TAGGTTGTCCCCGGGCCCAGCATCAAAAGCATTGAGACACGTAC 

TGAGGGACTCTTTTCCTAGCCTCTCAGTCCTGACTGCTCAAGGA 

CCAAGTGGTACTTCTTGCCTGCGTTCCTTTAATGCTTGCCTAAT 

ATGAGCTAGTCTTCTCTGATCACTTTTTTTTTTAATCCAAAGTA 

GGTGGGCATTGTCCCAAGAGCCTTTGGAAAGCAGCTGCCTCTCA 

CTAGGACTTCACAGCATCATTTTGCTTTGCTCTCTTTGTGGTTA 

AAATTACCTTCCATTCGTGGTGGGTGTATGTCAGGATCCCCACA 

AGAAACAGAGGGACACCCAAATTAGGGACATACTTCAGAGGGAC " 

TAATGACAAAGGCATGGGTGGGAGTAGAGGGGAATACAAGGGAG 

ACTTCAAGAATCTTGGCCTTTATTATAAATGCAATGTATGTCCA 

CTATGGAAAATTTGGG [A/G] AAAAAAGCAAACTAGAAAGAAGA 

AAAACCACATTGCCTGAATTCCTACTGCATGGAGAGAAGCATCA 

TAAACACCTTTTGGAGGAGTCTCTTCTTCCTTTCTCCCTTTCTC 

CTTCTTTGTATAGAGAGGTCTTTCCTGAGGACTTCCCAGAATCT 

TGCAGATCCAAAATCTTAAGAATTTGCAGAGGCAGTGAGGAGTT 

AACATG CAC AGCTGAGGGAAT ATTCTGCTTTTT ATCTGGAAC CA 

GGCTCGGAACAAGACTCCTTGCTTTTCTGTCCTGTGTTTTCATC 

TTCTCTCAGAACCCTAACTTTGAGATAAGATCTTTGACTATTAT 

TAGGCGGGTGCAAAAGTAATTGTAGTTTTTGGCATTATTTTTAA 

TAGAACTGCTTCTGTGTCCTCAGATCTCCATCGTTCATCTCCTG 

ATAAGTCCCTGAAAATTTCCTGGCCCCTTGGAGCTCCTTCCAGG 

AGTAGAATGATCACAAGAGCTGCCATGTATTGCTTAT 


NO. 185 


KCP 1692 
34 


TTGCTTATTCCAACTTGGACTTGCCGGAGTCCCATAGACAGAGG 
CTACTCTCCCACCGTGCTGAAGCTGGTGCATGCCATGTTTCAGT 
AAGAGAAAGGAGGGTGCCTGGGCTTCGTCTCCACCCAGGTGCCT 
CTCCCCCAGCAGCTGCACCAGGCCAGCTGAGGGGGATTTTAGCC 
CGAATCCAGGGTTTCTCCTACAGAAGACAAGGAGTTTGGGCACT 
GCCAGAATTAGAAGAACAGAAAGAAAATGTTCTGGATTTCTCAT 
CAAATGCCCCTAGCCTGAGAAATATAACTAAATTCACCCTTAGG 
TCATCTTACAATCTGTCCTGCCCCAGTGTTCCCCACTCAGGGAA 
CTGCTCCACCCACATCCTGGTCCCCAAACCAGAGGCCTGGGAGT 
CACCCTTGACATTTCCTCCCCTACACCCCTAATGAATCAAATCC 
TGTTTATCCTGCCCTCTGAGAGTCTGCACCGAAATCTCTCTCCT 
CTCCTTCCCACCTACC [A/G] TGGCCCAGCAGCTTTACCATCAT 
GTCTGACGGATCTCTGCACCGCTCCCAACTGGCCTGTGCGTTCA 
CTCCTGCCCCCTCCTCCAGCCCTGTGTACACTCCCTTCCACCAT 
CCTTTCTATACTCTCCTCAATCCTATCTGCACCCTCCTTCAACC 
CTGTCTGTACTCTCCTCCAATCCTGTCCACACACTCCAAACAAA 
GTCATATTTCCAAGACAAATTTGACCATGCCACTTTCTCCCACA 
GCTCTCCACCACCTCCAGGATCCCATCCTCAGTGTTAGCCAGAC 
ACTCCCAAGGCCTTGTGATCTGCCCTGCCTATGTCTCCAGCCTC 
ATCCTGCAACTCCCCCTACACTCTGTGTTCTGGCCATCAAACCA 
ATGGGCTCCTCTTCCTGCACCCCCATCCACTCTTGCACATCCTG 
CACTCTATGTCTGAACAGCTCGGTTTCTCTTCTTTTCTTCTGGC 
ACATGTCTGCTCTACCTGCAGGTCATCTTAGATGTCA 


SEQID 
NO. 186 


KCP_1738 
48 


AAAAGGGAATTTATTGGCTCATGTAACTGAACTTCAACATTTTA 
CAGAATCTCATTGGCTCCAATGGGCTCATACGTCCATCCCCAAA 
CCAATCACAGTGACTGAGGGATTATCCAAGGATCACACTGGCCA 
CTTTCACAGGTTTTATCCCTAAAGGAAATCACAGGTAATAGATG 
TGGGG CTG CAG AAATG CAACATG CACTTTT C CTTGAAACTG CAT 
CCCTTTTCCCTGAAGATGAAGCTTGAAAGAACTCTAAGAGGTTA 
AGCATGGAGCTGATGGGCAAGCCACAGGCAGAAAGAGTAGCTGT 
GCAGCCAGGCTCCTGGCCAGGGAGGGCAGATAAGGAGGGGAGGC 
AAAGTTTGGTAAACAGGAAGCTAATCTATGGGCAAGAATCATTT 
TCTTCAGCATCCTGACCTCTCCTAAAATGTTCTCCACTGGTCCC 
TGCTAGGACAAAGGAATTACCACCAGACTAGAGTCAGGAGTCCT 
GGGCTGGTTCTGCTGT [A/G] TGACACAGGACAGGTGGCTTGCC 


SEQID 
NO. 187 



NSDOCtD: <WO 2004041 193A2_I_> 



WO 2004/041193 



-151- 



PCT/US2003/034681 





TGGTCTGGGCCACAGCCTCCTCCCCTGTTGATGAGCATGTTGGT 
TGTTCCAGCACCATGTCAGCCCTAGAAATCTCTGAATTCTTGAC 
CAGATCAGTAATTGCTCTCTTGCGTTTACTTTTCCTTCAAATAA 
AGAGATTGGCATACAGGGGAGGAGCCCAGTACAGACGGCATGCT 
TGGCTCAGGTTCCAGAACCCAGAAACCAGACAAGAGTTGGGAAA 
CCATGATGGTGGAGGAGGGTGTGCCACTCCTTACTAGTGCCTAA 
TCTCTTCGAGACACTAATGTTTCAGTATTATCCACAGATTCTGA 
TGCCAGGCAGC CCAGATG ACTGGGGTCAGTTATTAG CATGCTTC 
CTGGAGGTGGTTCCCAGGTGCAGGCTACCTGCAGTCTGGCTGGA 
TGGGCGCTGCACCACACTTGCTTCTGGGAAGCTGGTTTTGGGGT 
TGCCACAATCTCTGAAAGAATCACTAGGCCACCCTCT 




KCP_JL739 
82 


TTCACAGGTTTTATCCCTAAAGGAAATCACAGGTAATAGATGTG 
GGGCTGCAGAAATGCAACATGCACTTTTCCTTGAAACTGCATCC 
CTTTTCCCTGAAGATGAAGCTTGAAAGAACTCTAAGAGGTTAAG 
CATGGAGCTGATGGGCAAG C CACAGGCAGAAAGAGTAGCTGTGC 
AGCCAGGCTCCTGGCCAGGGAGGGCAGATAAGGAGGGGAGGCAA 
AGTTTGGTAAACAGGAAGCTAATCTATGGGCAAGAATCATTTTC 
TTCAGCATCCTGACCTCTCCTAAAATGTTCTCCACTGGTCCCTG 
CTAGGACAAAGGAATTAC CAC CAGACTAGAGTCAGGAGTCCTGG 
GCTGGTTCTGCTGTATGACACAGGACAGGTGGCTTGCCTGGTCT 
GGGCCACAGCCTCCTCCCCTGTTGATGAGCATGTTGGTTGTTCC 
AGCACCATGTCAGCCCTAGAAATCTCTGAATTCTTGACCAGATC 
AGTAATTGCTCTCTTG [A/C] GTTTACTTTTCCTTCAAATAAAG 
AGATTGGCATACAGGGGAGGAGCCCAGTACAGACGGCATGCTTG 
GCTCAGGTTCCAGAACCCAGAAACCAGACAAGAGTTGGGAAACC 
ATGATGGTGG AGGAGGGTGTGC C ACTC CTTACTAGTGCCTAATC 
TCTTCGAGACACTAATGTTTCAGTATTATCCACAGATTCTGATG 
CCAGGCAGCCCAGATGACTGGGGTCAGTTATTAGCATGCTTCCT 
GGAGGTGGTTCCCAGGTGCAGGCTACCTGCAGTCTGGCTGGATG 
GGCCCTGCACCACACTTGCTTCTGGGAAGCTGGTTTTGGGGTTG 
CCACAATCTCTGAAAGAATCACTAGGCCACCCTCTGAGTGGGTC 
CTTCTGTAGGAATTATGGATAAAATTGTTCCACTAGTCTTACCT 
TCTTGGGGAACCCTTCCTGGATTCCCAGGCTGGGCTGGGTGTCC 
CTGCAGCCTAGCCCCACAGCCCTCCTGCTTCTCTTTC 


SEQID 
NO. 188 


KCP_1742 
43 


TGACCTCTCCTAAAATGTTCTCCACTGGTCCCTGCTAGGACAAA 
GGAATTAC CACCAGACTAGAGTCAGGAGTC CTGGGCTGGTTCTG 
CTGTATGACACAGGACAGGTGGCTTGCCTGGTCTGGGCCACAGC 
CTCCTCCCCTGTTGATGAGCATGTTGGTTGTTCCAGCACCATGT 
CAGC CCTAGAAATCTCTGAATTCTTG AC CAGATCAGTAATTGCT 
CTCTTGCGTTTACTTTTCCTTCAAATAAAGAGATTGGCATACAG 
GGGAGGAGC C C AGTACAG ACGGCATGCTTGGCTCAGGTTC CAGA 
ACCCAGAAACCAGACAAGAGTTGGGAAACCATGATGGTGGAGGA 
GGGTGTGCCACTCCTTACTAGTGCCTAATCTCTTCGAGACACTA 
ATGTTTCAGTATTATCCACAGATTCTGATGCCAGGCAGCCCAGA 
TGACTGGGGTCAGTTATTAGCATGCTTCCTGGAGGTGGTTCCCA 
GGT [A/G] CAGGCTACCTGCAGTCTGGCTGGATGGGCCCTGCAC 
CACACTTGCTTCTGGGAAGCTGGTTTTGGGGTTGCCACAATCTC 
TGAAAGAATCACTAGGCCACCCTCTGAGTGGGTCCTTCTGTAGG 
AATTATGGATAAAATTGTTCCACTAGTCTTACCTTCTTGGGGAA 
CCCTTCCTGGATTCCCAGGCTGGGCTGGGTGTCCCTGCAGCCTA 
GCCCCACAGCCCTCCTGCTTCTCTTTCTCATCACAGTCTTGTTA 
TCTCTACCAACTGTAGGCCTGCCCCACTGATGGTGTGAATAAAG 
GGACTGGGTCTCTCTAGCACCTAGCATAGATCTGATACATAGTG 
GGTGATCTCTATTGAATGAACGATGAATGAATGAATGAATGAAT 
ACATTTAGATAATTCAGATTACTCTTTCTAGCTCAGCAGTGTAA 
AGCAGGAAGACATGCTGTCAATATGATTTAGGGCAAGTTT 


SEQID 
NO. 189 


KCP 1751 


AACGATGAATGAATGAATGAATGAATACATTTAGATAATTCAGA 


SEQID 



1NSDOCID: <WO 2004041 193 A2_l_> 



WO 2004/041193 



-152- 



PCT/US2003/034681 



06 


TTACTCTTTCTAGCTCAGCAGTGTAAAGCAGGAAGACATGCTGT 
CAATATGATTTAGGGCAAGTTTTCAAATCTCTCTGGACCTCAGT 
TTTACCTCTTGAAAAATAAATATAATAATTTGTCCTTACTTCAT 
GAGACTATTTTGAAGATTAAATGAGATAATGTATACACTACTAC 
TCACTGTCCTTACTTGAATATTCCTAGGTCCTTGGTGCTACATT 
AGGCTACATAGAATGTATTTAAAGTAATAGAGTGGTATTTAATA 
AATATTCATTTTCTTTCCCCAGAACTACCTTAAATTAATTTGTT 
GAAAGGACAGATGGATGGATGGTTGATGGAAGTAGCAGGCTTCC 
AGCAGCAGGGGATGGAGTGAGTGTGTGGATACCGCTGGATCAGC 
AGAAGGTTATACCATTTTAGAGTAACTATCTCGGACTTCGGAGA 
GTTCCTGGGTATGAAG [C/G] TTTGGCTTTAATTAAAGTCTCAG 
CACAGTGTTAAATGCCATTTTATTTTAGGTCATAATTAACACTA 
ATGAGATGAGTGGATTACAAAGAGCACACATTTTGAGAAAGTGA 
AAAACAACATCTGAGCTTGGTGGTTTCCATTTTCGCTTTTCCCC 
CTCCCATGCTCTGTTCAATTAAAAGTTTTGAGAAAATATTACAA 
CCATACTCCTTGTCTTTGTGGTAATGAAGCATATTAATTTGAAT 
GTGATGAATACAATATTCCACTGACTTTTTTATTCCCTTATCTA 
CAAAAGTTTAAAATAATGGAC CAATTAAAC CAGGAGAGAAGAAT 
GCAGGGTTTGCCTGGGGATCCAATTCAGCAACCAGAGAACTGAA 
AGAACAAAATTTTTTGACGGAGTCTGGGCCAGACTTCATCCCTT 
ACCTATAGCTGACAAACAGTAAGTCAAATTGGGCAGATGTGGAC 
CAGCGCAGAACACATACTATATTGAGGATCGAAAGGC 


NO. 190 


KCP_1751 
70 


GTGTAAAGCAGGAAGACATGCTGTCAATATGATTTAGGGCAAGT 
TTTCAAATCTCTCTGGACCTCAGTTTTACCTCTTGAAAAATAAA 
TATAATAATTTGTCCTTACTTCATGAGACTATTTTGAAGATTAA 
ATGAGATAATGTATACACTACTACTCACTGTCCTTACTTGAATA 
TTC CTAGGTCCTTGGTGCTAC ATT AGG CT ACATAGAATGT ATTT 
AAAGTAATAGAGTGGTATTTAATAAATATTCATTTTCTTTCCCC 
AGAACTACCTTAAATTAATTTGTTGAAAGGACAGATGGATGGAT 
GGTTGATGGAAGTAGCAGGCTTCCAGCAGCAGGGGATGGAGTGA 
GTGTGTGGATACCGCTGGATCAGCAGAAGGTTATACCATTTTAG 
AGTAACTATCTCGGACTTCGGAGAGTTCCTGGGTATGAAGGTTT 
GGCTTTAATTAAAGTCTCAGCACAGTGTTAAATGCCATTTTATT. 
TTAGGTCATAATTAAC [A/G] CTAATGAGATGAGTGGATTACAA 
AGAGCACACATTTTGAGAAAGTGAAAAACAACATCTGAGCTTGG 
TGGTTTCCATTTTCGCTTTTCCCCCTCCCATGCTCTGTTCAATT 
AAAAGTTTTGAGAAAATATTACAACCATACTCCTTGTCTTTGTG 
GTAATGAAGCATATTAATTTGAATGTGATGAATACAATATTCCA 
CTGACTTTTTTATTCC CTTATCTACAAAAGTTTAAAATAATGGA 
CCAATTAAACCAGGAGAGAAGAATGCAGGGTTTGCCTGGGGATC 
CAATTCAGCAACCAGAGAACTGAAAGAACAAAATTTTTTGACGG 
AGTCTGGGCCAGACTTCATC CCTTAC CTATAGCTGACAAACAGT 
AAGTCAAATTGGGCAGATGTGGACCAGCGCAGAACACATACTAT 
ATTGAGGATCGAAAGGC C AGGTTC C AGAC CGTCCTCTAAT ATTT 
TCTTAGTGAATATTTGTTGGATGAATGCATGGATGGG 


SEQID 
NO. 191 


KCP_17 52 
52 


CTTACTTCATGAGACTATTTTGAAGATTAAATGAGATAATGTAT 
AC ACTACTACTC ACTGT C CTTACTTGAATATTCCTAGGTCCTTG 
GTGCTACATTAGGCTACATAGAATGTATTTAAAGTAATAGAGTG 
GTATTTAATAAATATTCATTTTCTTTCCCCAGAACTACCTTAAA 

GCAGGCTTCCAGCAGCAGGGGATGGAGTGAGTGTGTGGATACCG 
CTGGATCAGCAGAAGGTTATACCATTTTAGAGTAACTATCTCGG 
ACTTCGGAGAGTTCCTGGGTATGAAGGTTTGGCTTTAATTAAAG 
TCTCAGCACAGTGTTAAATG C CATTTT ATTTTAGGTCAT AATT A 
ACACTAATGAGATGAGTGGATTACAAAGAGCACACATTTTGAGA 
AAGTGAAAAACAACATCTGAGCTTGGTGGTTTCCATTTTC [A/G 
] CTTTTCCCCCTCCCATGCTCTGTTCAATTAAAAGTTTTGAGAA 


SEQID 
NO. 192 



BNSDOCID: <WO 2004041 1 93A2_L> 



WO 2004/041193 



-153- 



PCT/US2003/034681 





AATATTACAACCATACTCCTTGTCTTTGTGGTAATGAAGCATAT 
TAATTTGAATGTGATGAATACAATATTCCACTGACTTTTTTATT 
CCCTTATCTACAAAAGTTTAAAATAATGGACCAATTAAACCAGG 
AGAGAAGAATGCAGGGTTTGCCTGGGGATCCAATTCAGCAACCA 
GAGAACTGAAAGAACAAAATTTTTTGACGGAGTCTGGGCCAGAC 
TTCATCCCTTACCTATAGCTGACAAACAGTAAGTCAAATTGGGC 
AGATGTGGAC CAG CG CAG AAC ACATACTATATTGAGGATCGAAA 
GGCCAGGTTCCAGACCGTCCTCTAATATTTTCTTAGTGAATATT 
TGTTGGATGAATGCATGGATGGGTGGATGAATAGATGGATGGAT 
GGACAGATGGACGGAGAGAGAGATGGATGAATGGATTGTTGG 




KCP 1768 
36 


GCAGGCCTGTGAACCTGACACATGGTCCAGGTGTCTCCCTGAGG 

ACTTCTGGAAGTCTCCCCACCTCTCTGTGGTCCTTTAGGCATTA 

ACACCACCTTGTCACTGTGTCTTCTGAGGCAGTCTGGAAGTTCA 

TACCCCACAATCTCTGTGTACCTTGTCCCCCATTCTGTTCTCTG 

CATTGCAGATGGTTTAAAACACACACACATACACGCGCAAAATG 

TTGTTCCTTTTCTTAAAACCCATTGTGGCCAGGCTAGACAAATC 

CTTAACACGGTCTACAATATTCTGCATGGCATGGCCCCTGGGTG 

CCTCCCAACCTGATCTGTCACACACCACCTCCACCTTTGCCTGT 

TCCCTGGGCCCTAGCACTAACCTTTGGTTCATTCCTAGACACCT 

TTTCAGCACTTAGGCCCCCACAGCCCTCAGAACCTTTACACTTG 

CTGTCTCTTTTGCTTTAA [A/G] TGTTCTTGCCCCACCTACCAC 

CTAGTTAATGCCTTTTCCTCCTTCAGCTCTTAGTTGAAGCATCA 

CTTCCTCAAGGAGGGCAGCCCTGATGAAACTCATTATGCAAACT 

CCAGCCTGGGTTGGGCCTTATCTTTATGCTGTCATGGCCCTGAG 

TATTCTTCCTTTATGGCACCAATCACGGCTTATATGATATACTT 

ATGCTATTATTTGAGTTATGTCTGTCTCCCCCAGTATG CCACTA 

GTATTAGAATCATTGATTTTTAATCATTGTATCCCTAGTGCTTA 

GCACAGAGCCTGGCTCATAATAGATGCTTAATAAATATTTGTTG 

AATAAATGAATGAGTGAATGAATAAATGCCTCATTCAAGAGCTT 

TGGCTCTTTCTGTACTACTACATTACTTCTATTTTTTAGCTCTT 

AATTCTCAAAGCACTTTCTTTGTGCTGGGCTTATGCTGGGAGCT 

TAGACAGTAAAGCTTAGA 


SEQID 
NO. 193 


KCP_18 01 
73 


TTACATCCACAGGTTTGATTATAAATGTGTGTATTGAATTGGAA 
TTTCTGTTGAAATTCTGATCCCTTCTAGACAAAGAAGGTAAAAA 
TTGAAACATGTCAATGGATATCTAAATATCATTACTCACTGGCT 
TTATTTGCAAATGGCTTTC CATTGACAACAGTTACATTTTGTTC 
AAAGCAACAAATGATTGGCGCTGAGAATCCACAGGAACATGGTG 
CAGTCATTAATGAATGTGCTCATTATTCCTCCCTGCCGGGAGGC 
ATCGACTCCCGTTCTCCAGCCTGTTTTAAGCAGACAGACCTACA 
TCTGCACCTGTCAGCTTGGAACCCTAGTAGGGGAGGGGGATGCT 
GATGTGATGGAGAATGAAGAATGGGCCCTGCAGGCTGACATTTT 
GGGAGAGTAGGTTCTGAAATTTATCCCAAAGGACATGGAATCCT 
GGAAGCAGGGTTCAAGATCCTCCCAAAATTGATCTCCCAGGATG 
CTTGGAATGATTGTTC [C/T] GAGGGTTTTGTAAAATGCGAGGG 
GAAAACCAGGAAGCTTCTCTCCAGTTGTCTTGCCTCCTTCCTCT 
CCAGTCTCCATGGAGCTGACTTTGAGAATTAACTCCTGAGGGAC 
AGAGACCCTGGGATGGAGAGCCAGCCCTGCTGGATTCCACAAGG 
TGCTGCTTAAAGCACAACACCTCTTCCCAATGACAGGTTCTGAA 
AGAAGGCCTTGTAGCTAGATGCACAGAGGGTTTTGTTTTGTTTT 

TTTTTTTTTA A PPTTTHZi TIT 1 r ,r Pr« r Pr» rp7\ 7\ A 7\ rnrr^rT^rnr-trT^rM^r^r^m 

J-J-J.J.XAXJ. iftftvui x ± uiuLAi LlblL 1AAAATTGCTCTGGGCT 
GGGTACAGTGG CTCC CACCTGTAATCC C AACACTTTGAGAGCTG 
AGGCAGGAGGATCGCTTGAGCCCAGGCGTTCTAGACCAGCCTGG 
GCAATATAGTGAGATCTCTATGTCTAGAATGTTTTTTAATTAGC 
TGGGCTTGCTGCCTGCACCTGTAATTCCAGCTACTTGGGAGGCT 
AAGGTGGGGGGATCACTCGAGCCCAGGGGGCTGAGGC 


SEQID 
NO. 194 


KCP 18 02 
37 


CCTTCTAGACAAAGAAGGTAAAAATTGAAACATGTCAATGGATA 
TCTAAATATCATTACTCACTGGCTTTATTTGCAAATGGCTTTCC 


SEQID 
NO. 195 



tNSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-154- 



PCT/US2003/034681 





ATTGACAACAGTTACATTTTGTTCAAAGCAACAAATGATTGGCG 
CTGACAATCCACAGGAACATGGTGCAGTCATTAATGAATGTGCT 
CATTATTCCTCCCTGCCGGGAGGCATCGACTCCCGTTCTCCAGC 
CTGTTTTAAGCAGACAGACCTACATCTGCACCTGTCAGCTTGGA 
ACCCTAGTAGGGGAGGGGGATGCTGATGTGATGGAGAATGAAGA 
ATGGGCCCTGCAGGCTGACATTTTGGGAGAGTAGGTTCTGAAAT 
TTATCCCAAAGGACATGGAATCCTGGAAGCAGGGTTCAAGATCC 
TCCCAAAATTGATCTCCCAGGATGCTTGGAATGATTGTTCCGAG 
GGTTTTGTAAAATGCCAGGGGAAAACCAGGAAGCTTCTCTCCAG 
TTGTCTTGCCTCCTTC [C/G] TCTCCAGTCTCCATGGAGCTGAC 
TTTGAGAATTAACTCCTGAGGGACAGAGACCCTGGGATGGAGAG 
CCAGCCCTGCTGGATTCCACAAGGTGCTGCTTAAAGCACAACAC 
CTCTTC C CAATGACAGGTTCTGAAAGAAGGCCTTGTAGCTAGAT 
GCACAGAGGGTTTTGTTTTGTTTTTTTTTTTTTAAC CTTTCAG C 
ATCTGTCTAAAATTGCTCTGGGCTGGGTACAGTGGCTCCCACCT 
GTAATCCCAACACTTTGAGAGCTGAGGCAGGAGGATCGCTTGAG 
CCCAGGCGTTCTAGACCAGCCTGGGCAATATAGTGAGATCTCTA 
TGTCTAGAATGTTTTTTAATTAGCTGGGCTTGCTGCCTGCACCT 
GTAATTCCAGCTACTTGGGAGGCTAAGGTGGGGGGATCACTCGA 
GCGCAGGGGGCTGAGGCTGCAGTGAAC C ATGATTAC AC C ACTGA 
ACTCCAGCCTGGGCAACAGAGTGAGACCCTGTCTCAA 




KCP__1840 
80 


CTGATGGAACTGGGATGTGAGAAGAAGGCAGGTTTTCTGATAAA 
CAATTCCTGTATCTTTCACAAATGCCAAATCACAGACTCAGCTT 
GGGACATATGAGGACAGCACAGACTTTGGAGGCAGGTAGATTTT 
GGGTTGTCACGCAGACACCCACTACTATGAGACCTGGATTTCCT 
TCTGACGTTATTGGGGATAAGAAGTGGCACCTCACCATTTCTAG 
GAAATAGTAGGTAAGTCTTTCTGGTTGCCACTGAGGTGACTCAC 
CTGAGACACAGTTGCTCCTAAAGTTCAAGGTTAGGAGACAATCC 
AGAAGGGGAGCTGTCTGTGAAGTCAGAATTCTTGGAAGAATGTA 
AGTCTTTACACAGTAACAGCAAAGCAGACAGTGGGAACCACTAC 
TCTGCCTTCTTGCATCATTCTTTCCTAGAAATACCAGAAAGCAG 
TGAGGGATTAAGTCTAATTCCTGGCACCTGACCTTATATCTAAC 
AGATGCTCAGTATTAC [C/G] TGTTGATGGGACCTCACTGGGAA 
TGTTTTGTGTGCAGTACAAAAGGGCAATAGATGAAACTTTGGGA . 
CGGGAG C C CAGGAAAATGG CTGAGAGGAGAG CTTATGCCTAGCT 
TATGCATGAGCTTGCAAAAAGGGAGAATACACGGGAGGGAAGAT 
CAGCAACAGCATGAGTTTTATAAGGCAGAGAGTTGTTGGGAAGG 
AAGCAGCAGGGAGAGGGGAAGGAGTAAGTAGAAACCTAGAAGAG 
ATACAGCTAAGATAAGCCAAGAGAACAAAGTATTGACTTACCAG 
AAACATGGAAGTCTTCCTGCTTCTAATTTAGTTCCGCATATCTG 
GATATGTGAATGC CTAAAATCC CATTAAGC CCAGTGGGTTAATT 
ATTACACTTGCTAGGGCCCCAGAGGAGAGGAAACACAGTAAGTC 
AGAAAAACCTCf GGGCAGGTGAATTTCTCAGGTTTTCTTCTGGG 
CAGATGGGATCTGGAATGGTAGCGTGGCATCCTGGTA 


SEQK) 
NO. 196 


KCP_1855 
79 


CCTTTCCAATATTAAAATAATATTAACATTGGTAATAGTGGTAC 
TAAACAACTTAGGGTGTTTTTTTTTTCATTTAATAGTATATTTT 
TAGTATCTTTCCAGGAAAAGATACATGGATGTGCCACATTATTT 
TTAATGGCTCACATGGTACTCCTTTTATGTATGCACTATAATTT 
ATGGAACCAGTTTTCTCACCGATGAGCATGTAAGTTCTTTCAGT 
CTTTTACTGTTATAAACGAATGATGCAA:L UAAIAl LL11 1 AC A 
TATATATTTGTGCGCATATGTAGGTATCCTTACAAGTGGAATTT 
CTGAATAAATGGATATATACAATTTATTTATGAATTTACCTTCC 
TACAAGTGATTCAAGAGAGTGTCTTTGCTCCACAGTGTTGTCAA 
TATAGTGTATTCTCAAAATCTGACACCAATATGTGTGAAGTGCC 
TGCTCTGTTCCCACACTTTACACAGGTTCTCTTATTTG [C/A] G 
TTAAGTTTATTTAAGAAGAGGAAACTGGGCCTCATGGAGATCTA 
GGAACTTGCCCAAGGACAGGTCTCTGTGACTCTAAGAGTGCAAT 


SEQ ED 
NO. 197 



WSOOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-155- 



PCT/US2003/034681 





CTTCCC'rr'rrCCCCATGTCAAGCACCTTTCCCCACCAGGCTCAC 
TGCTGACAATCCAGTGTACGAAGAAGGGAAATTACCCCCACAGA 
GCCCAAAAGTTTAGGACATGCCGACAGCATCACTCTTTTGCCTC 
CTCATTCTCTCTTTCATTTCCAGAACATTTGCTCACTCAGTGCT 
GCCCAGTGATACTTAGCCAGCCTGATTACCCATCTAATAATTTC 
TGATACTAATATAAAACCTTCCCAAAGACAAA.TATAACTGAGAC 
GCACTCCAGCTTAC CATAG CTTTC CTGGTGGTACAGTTTCCAGG 
GACATTTCACTGTGTCAAAGCAGGGACCACATATGTTCCAGACC 

AGCTTGTTGGGTTTTTCACTGGGAAGTGAAGACAAATTGTTGTC 
CCTT 




KCP 1860 
48 


TTGCCACACTTTACACAGGTTCTCTTATTTGCGTTAAGTTTATT 

TAAGAAGAGGAAACTGGGCCTCATGGAGATCTAGGAACTTGCCC 

AAGGACAGGTCTCTGTGACTCTAAGAGTGCAATCTTCCCTTTTC 

CCCATGTCAAGCACCTTTCCCCACCAGGCTCACTGCTGACAATC 

CAGTGTACGAAGAAGGGAAATTAC CCCCACAGAGCC GAAAAGTT 

TAGGACATGCCGACAGCATCACTCTTTTGCCTCCTCATTCTCTC 

TTTCATTTCCAGAACATTTGCTCACTCAGTGCTGCCCAGTGATA 

CTTAGCCAGCCTGATTACCCATCTAATAATTTCTGATACTAATA 

TAAAACCTTCCCAAAGACAAATATAACTGAGACGCACTCCAGCT 

TACCATAGCTTTCCTGGTGGTACAGTTTCCAGGGACATTTCACT 

GTGTCAAAGCAGGGACCACATATGTTCCAGACCAGCTTGTTGGG 

TTTTTCACTGGGAAGT [A/G] AAGACAAATTGTTGTCCCTTTGA 

AAAAGCATCTTTCATCTCTCCATCTATCTGCGATCTAAAGCAAT 

GGGGCTCTTTCTGTATGTCTTTCAAATGGTCTACACTGACACAC 

GTTTTCTCTGAGCTGCCGAGAGAATATGCCATGAGATGTTGCCA 

GTGATGGTTACACTCAGCTAGCAGAAGATTAGGGACTGGTTAAA 

CCTTTGGAGAAATTGCCTTGGGAAAAGAGGAAATAAAAGCAAAT 

ATTACTATGAAACATAGAGATTACCAGGTAGGAGGAGGAGAGAG 

GTGGAGGGAGGGGTAGGAGTGGAAGGAAGGGAGGGAGGCAGAAA 

GAGGAAGGCAGACTGGTGGAAAATAAACCGTGCACTTTAGAACA 

GCAGGAAGGGAGGCTTGGAAGCCTGGTTTTCTGGCTTTGAATGA 

CCGCCTAGCGCTTGCCGGTGCGCCAGGGTGCTGTGAGGATGTGG 

GCAGAGGGCGAGTC CGAAGGGCTC CAGACACTGGGAA 


SEQID 
NO. 198 


KCP_18 66 
79 


GAGAATATGCCATGAGATGTTGCCAGTGATGGTTACACTCAGCT 

AGCAGAAGATTAGGGACTGGTTAAACCTTTGGAGAAATTGCCTT 

GGGAAAAGAGGAAATAAAAGCAAATATTACTATGAAACATAGAG 

ATTACCAGGTAGGAGGAGGAGAGAGGTGGAGGGAGGGGTAGGAG 

TGGAAGGAAGGGAGGGAGGCAGAAAGAGGAAGGCAGACTGGTGG 

AAAATAAACCGTG CACTTTAGAACAGCAGGAAGGGAGGCTTGGA 

AGCCTGGTTTTCTGGGTTTGAATGACCGCCTAGCGCTTGCCGGT ! 

GCGCCAGGGTGCTGTGAGGATGTGGGCAGAGGGCGAGTCCGAAG 

GGCTCCAGACACTGGGAATAGTGGTGGTCGTGTGCTCCTCCCTG 

AAACTTTTGCACTACCTCGGACTGATTGACTTGTCAGACGGTAA 

GCGAACCCTGGAGCTTCCCCGTTTTCTGTGAATGTGTTTTTGTG 

GCTTCGGTTGCTGTGA [C/G] AGTCGTTTCGAAAATGCACGGAA 

ATGAGGGCGGAGACCCGAGAGATTTGAAAAAGCCGGGCTGAAAC 

AGCGTGGTATTGGTCCCCGCCTCCCCAGTCGCGCCCCAGTGCTG 

CGCTGTCCGTCGTGCTGAAA.TGTGGTGCGCCTGGGGAGTGCGGG 

AGCCAGGAAGTTAGGGTCTCCTGCTCCGGCCCTATGAGCATGTG 

AGTCTTGATGGATTATTAGCTATGGGTGAGGCCAGCACAA.CACA 

TCACAATTCTCTCTGAAGCTGTCTGGTAACTACGTATATTGTTG 

ATGGAAGCCAGTGACTTTTAAAAGCCATTATGTTGATTAACTTT 

TTTAAAGAAGTTTAGGAGATTATATGGAGGTAAAAA.CCTTTGTA 

AAATGCTAATCACAGTGTCTGACAATTAGAACACATTTAATAAA 

TGTCAGTTTCTTTGCTCAACCCTTATAAGAACCCTTATTCCAAA 

GCCACCTCCTCAGCTCTGACTTCAGCTCCATTCCTTA 


SEQID 
NO. 199 


KCP 1871 


TGCTGTGACAGTCGTTTCGAAAATGCACGGAAATGAGGGCGGAG 


SEQID 



WO 2004/041193 



-156- 



PCT/US2003/034681 



16 



ACCCGAGAGATTTGAAAAAGCCGGGCTGAAACAGCGTGGTATTG 
GTCCCCGCCTCCCCAGTCGCGCCCCAGTGCTGCGCTGTCCGTCG 
TGCTGAAATGTGGTGCGCCTGGGGAGTGCGGGAGCCAGGAAGTT 
AGGGTCTCCTGCTCCGGCCCTATGAGCATGTGAGTCTTGATGGA 
TTATTAGCTATGGGTGAGGCCAGCACAACACATCACAATTCTCT 
CTGAAGCTGTCTGGTAACTACGTATATTGTTGATGGAAGCCAGT 
GACTTTTAAAAGCCATTATGTTGATTAACTTTTTTAAAGAAGTT 
TAGGAGATTATATGGAGGTAAAAACCTTTGTAAAATG CTAATC A 
CAGTGTCTGACAATTAGAACACATTTAATAAATGTCAGTTTCTT 
TGCTC [A/G] ACCCTTATAAGAACCCTTATTCCAAAGCCACCTC 
CTCAGCTCTGACTTCAGCTCCATTCCTTAGTGAGAATGGGGTTA 
TAAATCCAGGTTAACCCGATTGTTTAGGATTAGAAAGTGATTTG 
GTTTCCAACGTTGAAGGAGTTCAAGAAACAAAGAGTTTTATTTT 
TCCTCCTTATGAGATATTGTTCCAAATAGAACACAGTTTGTCTA 
GATGATTTTTGTCACTTAAAATTAGGCTCCAGGAAAGATTCCAA 
ATTTCATGAGCAATTGGGCTCATAAAACAAGATCAAACTCCAAT 
AGTGTATATCCAAAGTATGTATAATGTGTATTCGGTGTATATTC 
TTCCACCACTGCATGGTGTAGACAGAATTTCTCTTCCAAGGGGC 
ACCACATGACAAAACCGTACATAATAATGAAATGCATTTGTAGA 

CAAAGGACTAGCTAAAATACCAACTGAAAGTGGGAAGACCAGAA 
ACTGAAG 



NO. 200 



KCP__18 72 
58 



AATTGCCTTGGGAAAAGAGGAAATAAAAGCAAATATTACTATGA 

AACATAGAGATTACCAGGTAGGAGGAGGAGAGAGGTGGAGGGAG 

GGGTAGGAGTGGAAGGAAGGGAGGGAGGCAGAAAGAGGAAGGCA 

GACTGGTGGAAAATAAACCGTGCACTTTAGAACAGC^GGAAGGG 

AGGCTTGGAAGCCTGGTTTTCTGGCTTTGAATGACCGCCTAGCG 

CTTGC CGGTGCGC CAGGGTG CTGTGAGGATGTGGGCAGAGGGCG 

AGTCCGAAGGGCTCCAGACACTGGGAATAGTGGTGGTCGTGTGC 

TCCTCCCTGAAACTTTTGCACTACCTCGGACTGATTGACTTGTC 

AGACGGTAAGCGAACCCTGGAGCTTCCCCGTTTTCTGTGAATGT 

GTTTTTGTGGCTTCGGTTGCTGTGACAGTCGTTTCGAAAATGCA 

CGGAAATGAGGGCGGAGACCCGAGAGATTTGAAAAAGCCGGGCT 

GAAACAGCGTGGTATTGGTCCCCGCCTCCCCAGTCGCGCCCCAG. 

TGCTGCGCTGTCCGTCGTGCTGAAATGTGGTGCGCCTGGGGAGT 

GCGGGAGCCAGGAAGTTAGGGTCTCCTGCTCCGGCCCTATGAGC 

ATGTGAGTCTTGATGGATTATTAGCTATGGGTGAGGCCAGCACA 

ACACATCACAATTCTCTCTGAAGCTGTCTGGTAACTACGTATAT 

TGTTGATGGAAGCCAGTGACTTTTAAAAGC CATTATGTTG ATTA 

ACTTTTTTAAAGAAGTTTAGGAGATTATATGGAGGTAAAAACCT 

TTGTAAAATGCTAATCACAGTGTCTGACAATTAGAACACATTTA 

ATAAATGTCAGTTTCTTTGCTCAACCCTTATAAGAACCCTTATT 

CCAAAGCCACCTCCTCAGCTCTGACTTCAGCTCCATTCCTTAGT 

GAGAATGGGGTTATAAATCCAGGTTAACCGGATTGTTTAGGATT 

AGAAAGTGATTTGGTTTCCAACGTTGAAGGAG [G/T] TCAAGAA 

ACAAAGAGTTTTATTTTTCCTCCTTATGAGATATTGTTCCAAAT 

AGAACACAGTTTGTCTAGATGATTTTTGTCACTTAAAATTAGGC 

TCCAGGAAAGATTCCAAATTTCATGAGCAATTGGGCTCATAAAA 

CAAGATCAAACTCCAATAGTGTATATCCAAAGTATGTATAATGT 

GTATTCGGTGTATATTCTTCCACCACTGCATGGTGTAGACAGAA 

TTT CTCTTC CAAGGGGCAC CACATGACAAAAC GGTAC ATAATAA 

TGAAATGCATTTGTAGACAAAGGACTAGCTAAAATACCAACTGA 

AAGTGGGAAGACCAGAAACTGAAGTGTAAGATGAGGTAAGCCCT 

GGAGTAAGAGTCAAGAAATCCACTTTCTATCCATAATCTGTCTC 

GGTTTAATGTTGGTCAAGTCATTTTTTAAAAAATTCTAGGTCTT 

GGTTTCCTTATGATGACTTTAGATCTCTGTTCCTTGGAATTCTA 

GAGTGATCCAAAGGTTTCTTTGAATTCAGTTTTGTGGGTTGAGA 

CGGGCAGCCAGACTGTGAGTCCCTCAGCTCTGCTTCAACCAGAA 



SEQED 
NO. 201 



3NSDOCID: <WO 2004041 193 A2J_> 



WO 2004/041193 



-157- 



PCT/US2003/034681 





CAGCTCCACTTTACTGTTCAGCATGTTAGCCCTGTATGTAAGGA 
TGTTTTTTAGCTTTAGCTAAAATTTAGTGACTCTATGACCCTAA 
GGCCCTGCTTCCCTGAGATTTTGAAAGCTGAAGCACATTCGGAA 
AACTTTTTCTTCCTTAAAAATCACCTGAAATCTGACAATCTGGA 
AGACTAGTTCTGTCTGCTCCAGCCCTTGGTCCCTTAGATGTGCT 
TTTCTGAAGATCCAAACTCAACCTGCCAGTCAATATACCAACTG 
AGCAGAGCCCCTGTTCTCCACCAGATTTCAAGAGAACATGTTCC 
ATTCCTGTTCAGAGCTTCAGAGCAGCTTCCGCTAAGATTGCACA 
TTAATGCAACAGCGTCCTATTTTCTTTGTTTCTTTTTTTTTTTT 
TTTTTTTTTTTTTGATGAGACAGGG 




KCP_1876 
88 


ATTTTTCCTCCTTATGAGATATTGTTCCAAATAGAACACAGTTT 
GTCTAGATGATTTTTGTCACTTAAAATTAGGCTCCAGGAAAGAT 
TCCAAATTTCATGAGCAATTGGGCTCATAAAACT^AGATCAAACT 
CCAATAGTGTATATCCAAAGTATGTATAATGTGTATTCGGTGTA 
TATTCTTCCACCACTGCATGGTGTAGACAGAATTTCTCTTCCAA 
GGGGCACCACATGACAAAACCGTACATAATAATGAAATGCATTT 
GTAGACAAAGGACTAGCTAAAATACCAACTGAAAGTGGGAAGAC 
CAGAAACTGAAGTGTAAGATGAGGTAAGCCCTGGAGTAAGAGTC 
AAGAAATCCACTTTCTATCCATAATCTGTCTCGGTTTAATGTTG 
GTCAAGTCATTTTT [T/A] AAAAAATTCTAGGTCTTGGTTTCCT 
TATGATGACTTTAGATCTCTGTTCCTTGGAATTCTAGAGTGATC 
CAAAGGTTTCTTTGAATTCAGTTTTGTGGGTTGAGACGGGCAGC 
CAGACTGTGAGTCCCTCAGCTCTGCTTC!AAC , C!AnJ2xaPanr , Tnr , A 

CTTTACTGTTCAGCATGTTAGCCCTGTATGTAAGGATGTTTTTT 
AGCTTTAGCTAAAATTTAGTGACTCTATGACCCTAAGGCCCTGC 
TTCCCTGAGATTTTGAAAGCTGAAGCACATTCGGAAAACTTTTT 
CTTCCTTAAAAATCACCTGAAATCTGACAATCTGGAAGACTAGT 
TCTGTCTGCTCCAGCCCTTGGTCCCTTAGATGTGCTTTTCTGAA 
GATCCAAACTCAACCTGCCAGTCAATATACCAACTGAGCAGAGC 
CCCTGTTCTCCACCAGATTTCAAGAGAACATGTTCCATTCCTGT 
TC AGAG CTTCAGAGCAGC 


SEQID 
NO. 202 


KCP 1893 
31 


CTCTAAAATTTCACCCTCTGTTCTGTACACCAAGTACCTCAGCA 
AGTAATCCAGTTCCAGATGGGATCTGCAGTCTGCCATTAAGTCT 
TTACCACACATAGGCTCTTATGCTAGAGCC CTTACCATATGGT C 
CAAAATGCCATTTTTAATGTGTATTTGATATGGAGACTCTGTTC 
ACAATTTGAGTACTAAAGAGAGAATACCACCTCCTAGTAGATAC 
ACCAGGACCAATGTAATGCTGTCATTCTAAGGAGAGCAGTGGAA 
CATCTCCAAAGAACCCATCTGTAGTCTTCCTTCGGCCCTTGATC 
TTATTCCTATTTTATTTTTAAGGTTTTTTTTTTTTTCTTCGAGA 
CTAAATCTCACTCTATCACCCAAGCTGGAGTGCAGTGGCATGAT 
ATCAGTTCATTGCAACCTCTGCCTCCCGGACTCAAGCGATTCTC 
CTCACTCAGCATCCCAAGTATCTGGGACTACAGGCATACACCAC 
TATGC CCAG CTAGTGT [A/G] TGTGTGTGTGTGTGTGTGTGTGT 
GTGTGTGTGTGTGTGTGTTAGTAGAGACAGGGTTTCACCATGTT 
GCCCAGGGTGGTCTTGAACTCCAGAGCTCAGGCGATCCACCTGC 
CGAGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACAGCG 
CCTGGCCAATCTTTTAGGGATAATTTTAGAACAGTATACAGATA 
TTGAGCCAAGAGTCAAAAGAGCTGGGTTGCAATTCTGGTTGTGC 
CATTTATCAGTTGTGTGAGGTGGGACAAGTCTCTTTTTCTCCCT 
AGCTTTCTCTTTCCTCATTTATAAAATAAAGAAATGAGAATGAT 
AGTTGTATTAATTTCTGAGGACTG C C AGAACAAATTACTACAAA 
CTGGGTGGCTTAAAACAACAAACATTTATTCTCACATAGTTCAG 
GAGGCTAGCAGTTTGAAATCAAGTTCTTGACAAACTCCCCTAGA 
GTCTAAAGTCTCTAGAGAAGGATTCCTCCTTGCCTCT 


SEQID 
NO. 203 


KCP_1927 
42 


GAGCAAGCACTGCAGCCATCCTCCTTTATTTCCCTCAAGGCAAT 
ATCCAAGGATTAAAAAGTCAGAGCCGTCTGCAGATTCCTCCTCT 
CTACCTTGCCCTGCACTTTTTGTGCCCTTCCTCTTCCCCCTCTC 


SEQID 
NO. 204 



3NSDOCID: <WO 2004041 193A2J_> 



WO 2004/041193 



-158- 



PCT7US2003/034681 





CAGCCCCAAACCTCTCTCCTGATCCACGGTACTCCTCCTGGGAT 
GTCCACTGGGGCTGATCCTCCCCCATTCTCCCCCTGAGTTCCCT 
GCTGTTAATCTGTCTCCAGCAAAATTAACCTAGCCTATGTCCCA 
TGCCCTCTGGACTCTGGCTGCTCGTCAATCACTCTTAAAAATCC 
GGTTTCTCCTTAGGCAATCATTTTGTTTTGATTTTATGTGTAAA 

CTGTTGCTCAGGCTAGAGTACAGTGGCATGATTTCTGCTCACTG 
CAACCTCCGCCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCT 
C [T/C] TGAGTAGCTGGGACTACAGGTGCCCACCACCATGCCTG 
GCTAATTTTTGTATTTTTGGTAGAGACAGGGTTTCATCATACTG 
GCCAGGCTGGTCTCAAACTCCTGACCTTGTGATCCACGCACTTC 
GGCCTCCCAAAGTAATCACTGCTGGGATTACAGAAGTGAGCCAC 
CGTGCCTGGCCAAACCTAAGTAAATGTTTTAAAATTATACTACT 
AACATAGCATACAGGCTTTAGACTGTTGGTTGCTTTTAAGTTTG 
CTTACTTTAAAAGCTAGAGAGAAGATGGTTGAGGTGATCTTGTC 
TCCTTCAGTATTCACTCTGAGCCATGCCTCCTGAGGAAGTTTGC 
TTTAGGGGAGGCATTGCTATGTTATACACTCTACGATGCACCAG 
C C CTTGCCTCAGAAGGCAAGGTTTGAACCCCAAC ACTGTCTTTT 
GCAAACTGTTACCTTAGGAAATAGATTTTATCTCCTTAACTCAC 
TTTTTA 




KCP_1931 
93 


GTTCAAGCGATTCTCCTGCCTCAGCCTCTTGAGTAGCTGGGACT 

A.CAGGTGCCCACCACCATGCCTGGCTAATTTTTGTATTTTTGGT 

AGAGACAGGGTTTCATCATACTGGCCAGGCTGGTCTCAAACTCC 

TGACCTTGTGATCCACGCACTTCGGC CTCC CAAAGTAATCACTG 

CTGGGATTACAGAAGTGAGCCACCGTGCCTGGCCAAACCTAAGT 

AAATGTTTTAAAATTATACTACTAACATAGCATACAGGCTTTAG 

ACTGTTGGTTGCTTTTAAGTTTGCTTACTTTAAAAGCTAGAGAG 

AAGATGGTTGAGGTGATCTTGTCTCCTTCAGTATTCACTCTGAG 

CCATGCCTCCTGAGGAAGTTTGCTTTAGGGGAGGCATTGCTATG 

TTATACACTCTACGATGCACCAGCCCTTGCCTCAGAAGGCAAGG 

TTTGAACCCCAACACTGTCTTTTGCAAACTGTTACCTTAtG/A] 

GAAATAGATTTTATCTCCTTAACTCACTTTTTACATTTGCAAAA 

TGGGTAAATTGTGACTACCTCACATGGATGTCATGAGATGAAAT 

GTAAGAATGTGTGTCC CTGGCATATAGTAACCACTTTCGCCAAA 

GACTGAGTTATCCAACTACAGACAGAGAACAGCTGGTGGCCTAA 

TCAAAGGGAGATACAAAATAACAATGCCAAGACTGGAAAAGGAA 

GTTCATCTTAGGATTTCCAAGAGAAAAAGAAATATGACTGTATT 

ATAATAGGTATATTTATTAAG CTCTTAC C ATGTG CC AAG CAAAG 

TTCTTTATATACATGATATACTTCATATACATTATTTCATTTAG 

TCCTCATGGCTACCAGGTGAGCACCATTATTTTCCCATTTTACA 

GATGAGGCACAGAGAAGTTAAGCCACTTACCTAGGAAGGGCAGT 

CCTAGTTAAGAAGCTGGGATTCA?UVTCCAAGAGGCTGGATTCCA 

GACCTCAGG 


SEQID 1 
NO. 205 


KCP 193 9 
56 


TTATAATAGGTATATTTATTAAGCTCTTACCATGTGCCAAGCAA 

AGTTCTTTATATACATGATATACTTCATATACATTATTTCATTT 

AGTCCTCATGGCTACCAGGTGAGCACCATTATTTTCCCATTTTA 

CAGATGAGGCACAGAGAAGTTAAGCCACTTACCTAGGAAGGGCA . 

GTCCTAGTTAAGAAGCTGGGATTCAAATCCAAGAGGCTGGATTC 

CAGACCTCAGGCTCTATTATGAGAAGTACCTAAATAGAGATTGG 

TTTAACCAAAGCCTGAGTCCCAACTAAGGGCAAGACTGTGACAC 

AGAGGTCACTAATCAGAATGAAAGATTGAGCCAGAGTTGAGTTG 

TTGGAATGTATTTTGGTACATTTAGGTTGTTTTAAGTATATCAA 

TCTCCATTCCACTCAATGGTTGAGTTCAGTTTCAAGTTTTCCAA 

ATGCTTTATGGGAAAGTCATATTTTTCTCCCATTGCAGCAGGGA 

TGCCAGCGCAGCCATG [C/T] TTCTCAACCACCAAGTAGAAGCA 

AAG C CAAACTGACCCAAGAAGATGAACAG AGGGAATCCAGGGAG 

TTCCAACTTGGGTTCACAGCTGCAATTCTCAAAGGATGGACTAA 


SEQID 
NO. 206 



3NSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-159- 



PCT/US2003/034681 





GCCATGTCACCCCTCCAGATAACACAGTCATATTAATAGTGACC 
TTTTGGAGGC CTCCCTAAACAGCAGGTGAAGTCC CAAAATC ATT 
AGATTATTCCTGGCCTCAATTGTGGCCCAGAGGGAGAGCCCTAA 
GATTTTTCCATGGGAACAAAGATCTAAATTCTGGGACTATCTGG 
GCCATGTCCACCCTGCACCATTTACTACAAAATGGGCTGATCCT 
ATGGAAGCACACTACCTGTGTTGTGGTCATATAGATCATCACCT 
GGCTTCTCCAGGGCTAACCAGTTAGCATGGAAATGGGACACCCA 
AGAACAAGAGGATAGAAAGAAGGGAAGGGTGGAAAGAAGGAAGG 
AAGAAAGGGTGGGAGGGAGGGAAGAGTGGTAGTTTTG 




KCP_1946 
16 


TCCTGGCCTCAATTGTGGCCCAGAGGGAGAGCCCTAAGATTTTT 

CCATGGGAACAAAGATCTAAATTCTGGGACTATCTGGGCCATGT 

CCACCCTGCACCATTTACTACAAAATGGGCTGATCCTATGGAAG 

CACACTACCTGTGTTGTGGTCATATAGATCATCACCTGGCTTCT 

CCAGGGCTAACCAGTTAGCATGGAAATGGGACACCCAAGAACAA 

GAGGATAGAAAGAAGGGAAGGGTGGAAAGAAGGAAGGAAGAAAG 

GGTGGGAGGGAGGGAAGAGTGGTAGTTTTGGAAGGAAGGAGGGA 

ATCAGAGCTAAAGATAATACATGATATGAGTCAGTGTTCAATGT 

CCCTGAAGATTAGGGGAATCAAGCTTTGCTTCCAGGAGAATTAA 

CACAGGAGAGCCAACAGAGATGTGGAAATTTAGGAAGTCAGAGG 

AGACATTCTTTCA [T/C] TCATTCATTCGTTCATTCATTCACTT 

GCTCATTTTTACATGAATTGACTCTAGAACAGATGCTGGAGATA 

CAAAGATGCATGAGACTTGCCCCCATCCTCAACAGTCATTCACA 

GTCTAATCAGAAAGAGAG C CTTGCATTTGGAATACAATATGGAG 

TAATAATACCTCTGTGTTCAGCCTGCACAAAATACTCTGTATGC 

ATGGTCATATGTCCCTTGAAACAACTTTATGAGGAAGATACTAC 

TATAGTCTCCATTTGACAGATAAGGAAACTGAGGCTTAGGGAGG 

TCAAATAACTTGCCCAAGTAAAACAACTAGTAAGTAGCTGAACC 

ACAAAACAGAGATTCATGCAGAAAGCTGTACAACAGAAGAAACC 

AGGACTAC ATCTG C CTCAAAGGAACCAGAGAAGGCTTCCAAAGA 

AGGCAGCATTTTAAATGGGTTTTGAAGGATGTATAGCA 


SEQED 
NO. 207 


KCP 1965 
48 


CCCAGCCCTCAGGCACATCAGTGCCCTCTCTAGGCTCTCTCTCA 

CCZAACTTTAGAATTGAATTACATCAGTTGTTTCCAGATGGTGAT 

CTGC AGAATTCCTTTAAAGAC C ACCTGTGGGATTTGAGGGAGGA 

AAACTACACTCTCCCAATCTCCCTCTTTAACCCAAGCATCTGAT 

TGCTTTCATCTGTTTTACATACTTAGCTTCTGTGCACAACTTCC 

TTTGATTAAAGAGTTCCTTGCCTTTATAGTAGTGGATGATATCT 

AAGGATGATGTAAAATACTGGGTGTTAGCTAAGGTTTTACCAAA 

CTTAAAGCCTTTATGCTTCATAATTCCACTTTATTGATGTAGGA 

AGACAAATGATAGACTTACTTTCAAGGTGGATAGAAGGGATGCG 

ACCTAGCCAAGGCTACAGCATTTCTCT [A/GJ TGGCACCACTGC , 

CATGACAACCATCAGTTTGAATGCCTTATGGGTGCATCCTATGG 

GTTATGCACTGGCCCCAAGC CATAACC CCTAGGACTCTAGAGC C 

AGCAGCAAACACAAAACACTGAATTAATAATGAGTGAGATCTCT 

GTTCCC ATAGCTGC CACAGG CTAAATAAGTTGAGGGGGTATTGT 

AAAACCCAAGATGAGATCACTGAGCCTCTGGTATCAAAAAGGTG 

TATTTCACAGAATGTTTAGTTGGACGAGAGCTTGAAGAGCATGG 

AAACGATCTGGTATCATTCTGGT CAAAGAC CAGAATTTAGACC C 

CAGTTCTGCCATTTGCTGACTAATGACTTTGGGCAAAATACTTA 

ACTTTCCTGAGAGTTAGTTTCCTCATCTATAAAGTGGGGTAATA 

TAACCCACCTTGCAGGATACTGGTAGGATTA 


SEQID 
NO. 208 


KCP 1976 
78 


AGCAGGGTCCCCTGAGCGTTCCTCCTCAGCTCCCAACACTTCCC 
TCCAGGCACCAGTATGCAGGGCAAGGTCCTGGAGGGGGCGCCGA 
AACACCACTCGAGATCCTCACTCTCAGGAATTCAATATAGAAAA 
CACATTAAGACCTGTTTACATGGAACTGCTGTTTATAATTATTG 
TTCCCTATGGGATATTCCCCACTGCTTCCTCCAATCCTCTTTTA 
AACTGCTCAACTAATAGAGTTTTCCTGGCTTCCCCAGGGAGACA 
TTCACAGATGCTAATAGAGACATAATTCAAAAATTGCTTGATAT 


SEQED 
NO. 209 



WSDOCID: <WO 2004041 193 A2_l_> 



WO 2004/041193 



-160- 



PCT/US2003/034681 





ACATGCCCTCAATTTTCCCCAAGAACCACCTAAGTAAAGAGCCC 
CAGACATGCAACACATTCATTGGCCAGATGCAATTTAACATGCG 
TGGGATTAAATATACAGGCTACTACAGCCAGGTTGTCATCAAGC 
AGCAGCAGGCATGGCATTTTATCCTAAGGTACCACCA [T/C] GG 
CCAAATGCAACAGGAAAGAAGCAGGCTGCTGGGTGGGACCCCTG 
GAAGATCCCCTCCTCTGTAATTTCCACTGCAAGCTTTTCCCAGG 
CCTTTTCAGGCAAAGCGGGGAGTTTTGAAAATAAATCCCCCAGG 
CTTGGAGAAGCAAAGAATCAATGCTAAGCAGCTCCGGAAATAAT 
AGCTTCCATCTCTCTGATATATAAAGAGGATAAGGAAGGCAGAA 
AGAAGGGGCATGATATTATGAGATTGCAACAATACATTGCAACA 
TTACATTAAAGAATTACAGAAAGCAAGATCTAGCTTCAGATGCC 
AGTTCATGCACTTACTCCCTGTGTGACCCTGGGAATCACTTAAG 
CTGTCTGAGACTTAGCTTGTCTAATGACAAACTGGGGATACTAA 
TATCACCTCCCAGGATTGTTGGGAAGGTAAATGGAGATTGACAA 
ATGTGAACACACTTAGTATGTCTTT 




KCP_1977 
75 


TGTTTACATGGAACTGCTGTTTATAATTATTGTTCCCTATGGGA 
TATTCCCCACTGCTTCCTCCAATCCTCTTTTAAACTGCTCAACT 
AATAGAGTTTTCCTGGCTTCCCCAGGGAGACATTCACAGATGCT 
AATAGAGACATAATTCAAAAATTGCTTGATATACATGCCCTCAA 
TTTTCCCCAAGAACCACCTAAGTAAAGAGCCCCAGACATGCAAC 
ACATTCATTGGCCAGATGCAATTTAACATGCGTGGGATTAAATA 
TACAGGCTACTACAGCCAGGTTGTCATCAAGCAGCAGCAGGCAT 
GGC ATTTTATCCTAAGGTACCAC CACGGCCAAATGCAACAGGAA 
AGAAGCAGGCTGCTGGGTGGGACCCCTGGAAGATCCCCTCCTCT 
GTAATTTCCACTGCAAGCTTTTCCCAGGCCTTTT [C/T] AGGCA 
AAGCGGGGAGTTTTGAAAATAAATCCCCCAGGCTTGGAGAAGCA 
AAGAATCAATGCTAAGCAGCTCCGGAAATAATAGCTTCCATCTC 
TCTGATATATAAAGAGGATAAGGAAGGCAGAAAGAAGGGGCATG 
ATATTATGAGATTGCAACAATACATTGCAACATTACATTAAAGA 
ATTACAGAAAGCAAGATCTAGCTTCAGATGCCAGTTCATGCACT 
TACTCCCTGTGTGACCCTGGGAATCACTTAAGCTGTCTGAGACT 
TAGCTTGTCTAATGACAAACTGGGGATACTAATATCACCTCCCA 
GGATTGTTGGGAAGGTAAATGGAGATTGACAAATGTGAACACAC 
TTAGTATGTCTTTACATAGTAGGTATTCAATAAACTCTTCTATA 
TATCTTCTCTTTCTGAAAATCTGAATATGGGGAGCATGGATATG 


SEQID 
NO. 210 


KCP 1989 
33 


GTCCCCACCACTCCTTTTTATTGCAGAGGGAATTGACATTCAGG 
GAATGGAAATGCCCAGCCCAGAATTGGGGATGTGGTCTGGGAAC 
CCAGGTCTCCCATCCCACTCCCTCGCCCTCTCACCCCCTCCCGC 
TGGTCAGTGTTCTTTGTCCTCTGCTGGCATCCCTGGGGACGGGC 
CAGCCCCCATCCCCCCGACACACACACATTGTCCCTTCAAGATG 
GAGCCAGGCTGACACCACGTAGAATGACCTGGAAGCC CC CACTC 
AGTCTAC CAGTC CTCC CTCCTCACACAGGAATAG ATGGGAGGGA 
AATGAAATAAGCTGCCATCTGCTGTGCATCCTCTGTGTGCCATG 
CTCTGGGTACCCATCTAATCCTCGTGAAGACCCTGAGAAGTGAG 
TGTTCTTCACAGACTAGGCAACACCAGAAGGCAG [G/A] TGAAG 
AACGTACAGAAGCTACAGAGTGCACAGGTGACAGGTATGAGAGC 
CAAGCCATTCAAACTCCCTGGGTATAGGACCCAGCTCTTCCCAC 
GTCTCTGCCTTTACCGAATCAAACACCTGAGCACGGAAGACCCT 
GCATCAACATGAACTGCTTTGAATTGACATGAACAAGCTTCAAT 
uuwiL i a jl aaa 1 vjL. 1 Li AAA 1 1 X XT C AATTATAG AAAG T ATTTG A 
AAGATCCCATAAATTCCCCTGTCATATCACGTGAGCTGCATTTA 
CTGCAGCAGACACTTTTTATCTCGGGCTTGGAGGAAGGATTAGC 
AAGAAGAAAGTGGAGGGGGTCTGAGGAAGGGCTGGCAGCCTAGA 
GGAGGACAGCAGCAAGAAGCAGGCTGGAGGCAGTTCTGTGCTGC 
CGGCCTTCATGGGTGTGGCCTTTGGACAGCACCTTAGCAGGAAT 
GTGGTGG AGAGCAGC C C CATTCACTCC AGAGG AGAGC 


SEQID 
NO. 211 . 


KCP 1993 


AAGACCCTGAGAAGTGAGTGTTCTTCACAGACTAGGCAACACCA 


SEQID 



JNSDOCID: <WO 2004041 193 A2J_> 



WO 2004/041193 



-161- 



PCT/US2003/034681 



65 


GAAGGCAGGTGAAGAACGTACAGAAGCTACAGAGTGCACAGGTG 
ACAGGTATGAGAGCCAAGC CATTCAAACTCC CTGGGTATAGGAC 
CCAGCTCTTCCCACGTCTCTGCCTTTACCGAATCAAACACCTGA 
GCACGGAAGACCCTCCATCAACATGAACTGCTTTGAATTGACAT 
G AACAAGCTTCAATCAAACTATAAATG CTG AAATTTTTCAATTA 
TAG AAAGTATTTGAAAGATC CCATAAATTCC CCTGTCATATCAC 
GTGAGCTGCATTTACTGCAGCAGACACTTTTTATCTCGGGCTTG 
GAGGAAGGATTAGCAAGAAGAAAGTGGAGGGGGTCTGAGGAAGG 
GCTGGCAGCCTAGAGGAGGACAGCAGCAAGAAGCAGGCTGGAGG 
CAGTTCTGTGCTGCCGGCCTTCATGGGTGTGGCCTTTGGACAGC 
[A/ G] CCTTAGCAGGAATGTGGTGGAGAGCAGCCCCATTCACTC 
CAGAGGAGAGCCTCAAACTCTTCAGGCAGATCTAGCCTAGGTAG 
AATCTTGGCCTGGCCCCTCCGGGATGACAGGTGCCATTGCCCAA 
GAATGGGGAAAAGGCTGAAGTGCTCCAGCCAAAGACCCCAATTT 
ATCTTCAGGACAATTTTCACTGGAAACCTTGCCTCACCACTGCC 
CACTTTTTCAGAAGTAATTAGAATGCTAATCTATAAGAAAGATG 
ACTATTAAAAATAAATTAATAATAGATAATACATTTTGGCTTAC 
AATTTTGAATAATATAGCCATCCCATCTTAAAGTAAAAATTCAT 
ATATTTTTAATAAGCCTGAGACATGTTTTCCAATGAACCACAGA 
TGGTTCATTTTTATTATCCTATAAAGAGACATTATGGGCAAGTG 
TTTTTTAAAATGGTAAAACAGAACCTTAGAGCAGCTCTCTTTT 


NO. 212 


KCP_20 02 
41 


GGCAAGTGTTTTTTAAAATGGTAAAACAGAACCTTAGAGCAGCT 
CTCTTTTGAAGATCTCTAAGCACTTTCTAAGCATCAGGACCCCC 
TTCTGTCATCACAGAGACTGAAATGAGGAGATGGTCTCTGTCAC 
CCCCTCACTCACCAGTGAGCCCCAGACCTTCATCCCTGATCAGA 
TGGAAGCAGTGTGGCATGATTACAGTTCATATTTCAACTCTGCC 
ACTCAATGACTAATAGCCAAG CACTAATAATGCAGAAAATGTAA 
ATTTAAAAAATAATCTTC CTGAGATTGGTTATGAAATGCACTCA 
ACACAGCACCATCCACAGAGAGGTTCTTTTTAATTGCTCTTTTC 
TTTCCTCTCGACACCCAGAATCACAAAGCATGCCTGAAAGCGTC 
ACACATATATGTCTGTGACCATAACATGGCATTGCACATGCAAA 
GGAAATAA [A/G] TAGGTGTTAC CCATGTGACAAAGGTCCATGA 
GCTCTGTCCGCAAAAAGCTGTTGAGTTTAAAGAACAAATAATTC 
TGAAAAATCTTCCAGGAGATGAAATTTGTAGAACTCAAGGGCAG 
TAAACTAGCTGCTTTCCAAGGACTTGTCATAGCTTTATTGACTT 
ACAATAGCCA7UVGATAAGTCAGTATTAATCAAACCCATTCTCTA 
GAAAAACCTCATCATCACTGGGGCCAGGGCAGAGAAGTGTGACA 
CAGCTCTCTCCAGCTTCCCCACTTCACAGCATGGTTCCACCATC 
CACCCAATTGCTAAAGCCTGGATAGTCTTCCTTGTCACCTCCCG 
ATCCCCTTCTCTAACACCCATCCCCCGGCCACCCAACATCAGCA 
AGTCTGGTGGTTTCTCTCTGTCACAGAGATTCAAGATCTTCCC 


SEQID 
NO. 213 


KCP 2019 
85 


TCGTAGTGTTCATAGACTCTCCTTCCTTCCTAACTTAAAAGAGG 

CTCCTTCTGGTTTTTCCTTCATACACTTCCCTCACTCTTTTCCT 

TC ACTGC ACTAAAGATG ATTTCTAATTG CATAGTCATTGATGC C 

AGTATTTGTTTATTGTGTCATTCCTGCTGAACAGAGGATGGGCC 

TGACTTATTTGGGACCATGTTGCTGATGCCTGGACCTAAGCCTG 

GCACAGAGTAGGAGCTCAACAAATTTGTTAAATGAGTGGCTGAA 

TGGCCATACTCTCAAAGGACCCACAGTCTAGGAGAGACAGAAGA 

ATCTTTGTCTTTTTGTCTTGCAGTGGGATGGAAGCTGCAGGGAG 

GGGTCTTGTCACATTGATACTGTCTGGGGAAGACAGAAAAACTT 

CAGTTTC AGAGGAGGTAGC CCTTGAAAC [G/A] AGATTTGAGAG 

AGGGCAGCACATTGTACAACTCCATGGGCACCATGCACATTGTA 

GTCCAGATAAACAGAG CC CCTTGGAGATATGTGAGGCATGGGAT 

AGACTCAGAGAAACCCAGGAAATAACCCCTTCAGGCATCTGACA 

TGCAAAGATGTGGAAGTGTCAACCAGGAAGTCATGTTGGGGGAA | 

CAGCAAGTATTTACAGAAAGTGACTGTGTGTGTCTGTGTAGGAG 

GGTGACTTTGTATAGGAGAGATAAAACCTGTGAGCTAATCAAGG 


SEQID 
NO. 214 



WO 2004/041193 



-162- 



PCT/US2003/034681 





AGAAGATCATAAAAGACCTTCATAAAGAGCATGGCCTTTTTCCT 
GCAAGCAGTGAGGAGCCATTGAAGGCTTTAGCATAAGGACAGTC 
AGATGTACTTCCCTAGAATGCACATTTCCTTCTGCTCCAGAACT 
TCTGCACAGGAGGCTCCTAAAAGCTCTCCCCATCCTCCCTGTAC 
ACGTAGAATCTGCCTCTGTCTCTCTTTCTCTCT 




KCP 2020 
67 


CACTTCCCTCACTCTTTTCCTTCACTGCACTAAAGATGATTTCT 

AATTGCATAGTCATTGATGCCAGTATTTGTTTATTGTGTCATTC 

CTGCTGAACAGAGGATGGGCCTGACTTATTTGGGACCATGTTGC 

TGATGCCTGGACCTAAGCCTGGCACAGAGTAGGAGCTCAACAAA 

TTTGTTAAATGAGTGGCTGAATGGCCATACTCTCAAAGGACCCA 

CAGTCTAGGAGAGACAGAAGAATCTTTGTCTTTTTGTCTTGCAG 

TGGGATGGAAGCTGCAGGGAGGGGTCTTGTCACATTGATACTGT 

CTGGGGAAGACAGAAAAACTTCAGTTTCAGAGGAGGTAGCCCTT 

GAAACGAGATTTGAGAGAGGGCAGCACATTGTACAACTCCATGG 

GCACCATGCACATTGTAGTCCAGATAAACAGAGCCCCTTGGAG [ 

A/G] TATGTGAGGCATGGGATAGACTCAGAGAAACCCAGGAAAT 

AACCCCTTCAGGCATCTGAC^TGCAAAGATGTGGAAGTGTCAAC. 

CAGGAAGTCATGTTGGGGGAACAGCAAGTATTTACAGAAAGTGA 

CTGTGTGTGTCTGTGTAGGAGGGTGACTTTGTATAGGAGAGATA 

AAACCTGTGAGCTAATCAAGGAGAAGATCATAAAAGACCTTCAT 

AAAGAGCATGGCCTTTTTCCTGCAAGCAGTGAGGAGCCATTGAA 

GGCTTTAGCATAAGGACAGTCAGATGTACTTCCCTAGAATGCAC 

ATTTCCTTCTGCTCCAGAACTTCTGCACAGGAGGCTCCTAAAAG 

CTCTCCCCATCCTCCCTGTACACGTAGAATCTGCCTCTGTCTCT 

CTTTCTCTCTCCTCCTCCTCCTCCATCTCCTCCTCCTCCTCCTC 


SEQID 
NO. 215 


KCP 2 02 
795 


GCTCCAGAACTTCTGCACAGGAGGCTCCTAAAAGCTCTCCCCAT 

CCTCCCTGTACACGTAGAATCTGCCTCTGTCTCTCTTTCTCTCT 

CCTCCTCCTCCTCCATCTCCTCCTCCTCCTCCTCTCCCTCTCTC 

TCGCTGTCTCACACACACATACACACACACTCCTTCCTTCCTAT 

CTAGTCAGATTCCACTCCTTGGGATTTCAGGCCCACCGTCACTC 

CTCAGGGAAGCCTGCCCTGAATGCCTGCACTACACCAGGGCCCC 

TTTCCCCTGCCCCCATCCCAGAGCACCAAATAGCTTTCCCTTGC 

AGCACTTCTOVCAGCTGTCATTTTATGTTTGTGTCTGTGATTCT 

TAGGTTAAGTCCCTCATGCACCAAATCATAAGATCTGGGAACAA 

GGACCACACCTGTCCTG [C/T] TCATCACTGTAATCATCACACT 

GCCTGCCAAAGTGCCTTGCACATATTAGATACTTAGTAGTTATG 

TGTTCCATGAATGACTCTTTAAGAGATCTTCTAGCTGTTCTTGC 

AAAGAACCCATTGGTAAGGTTGAACCTACAGGCTGATACTTTGC 

ACTAGTCTCAGGAAGAGATGGTGAGTACATGAAATTGAGTCCCC 

CAGAGGTTAATGCCCAGTGCCCCAGCTAGGAAACGTCCAAGGAG 

GCAATTTGAACCCCATCTGTCTGGCTGCAGAGCCTAGCCCTCTA 

ATGCATTCAGGGGTCCTAGCTCCTCGAGGATGCCACTGTGCCGT 

GAACTTCTTTCTGACCCTCATGGCTCCCAGCACAGCATCCACAC 

TCAGAAGTGCAAGATGAATGTTTGCAGATAATGAACATAAAGCT 

CTCAGGAACCCTCATCTCCTGAGAATCTGCTTTGGCCCCCACAG ' 

CAGGTCTGGGTGTGGACCTTCCCCA 


SEQID 
NO. 216 


K;CP_2 042 
42 


GTGGCAAAGTTGGGATCTTAACCCAGTTCTATGTGGCTATAAAG 
TTCATGGAATAGAATGCTGCAGTTAAGAACATGGGCTTTGGCAT 
CAAGCAGACCTGTATTTGAGCCCCACCTCTGCTGTTTATTAACT 
^ J- v3vjcv^u i vjlsLiCALiAi. CxACCTTACATCCTTAAGTCTCTAGTTCT 
TTGTCTTTAAAAGGGTGGCAGAATGTACCTCACTGGTTTTAGGA 
AGGTCACATGAGATAGTGCACATGAAGCCCTAGGCATGGGAAAA 
TTCTTCTAAAATGTCAGCTGCCATTCTGATCACTGCAAGACCCC 
CACCCCC^^TACTCCCAATTGTACCACCCCACCCCACTCACCAG 
TGTCTCAGAAATGCCTCCTCCAGAAGGAAGGCATCCTGTCTAAC 
CCACTGCTTCTAGCCAAGCTGTCTTTCTTCAGAAGGTAGAAAAA 
GATTGTTAGTCATTGTTTAATCTTTATTGAGTATATACCGCCAC 


SEQID 
NO. 217 



MSDOCID: <WO 2004041 193 A2J_> 



WO 2004/041193 



-163- 



PCT/US2003/034681 





ACCAATTGCACTGCCA [C/T] TCATTATCTCATTTAAATCTGAC 
AAGAGCCTTGTAAAGTAGGGATTATTCCCACCATTTCCCAGATG 
TTGAAACTGAAATTGATAAACACGACATGTTGCCATGGCTACAT 
GAAGATCTCCAAGCCGGAGGATCTCCACCCTCACCTGCCTAGCT 
i tLLAbALL 1 CTCTGCAGAAAAGGGACTGACCCCCAAGACAGCC 
CTGGCCTCTGGGCTCCACCCCTTCCACATCCATCCCAGGGCCGC 
TGAGGACTGAAGAGTTCTCCACGTTTGCCCTTTAAAGTGACTTA 
AAAATAATCTTTATGAAT1TCTTCATATACAAAATTTGTACTTA 
CTCATTGCAGCAAATTTAGAAAATACACATAAGCAAAAAAGAAC 
GTAACAGCCATCCATAACCCTAACTCTCAGAGATCACCACTATT 
AAAATGTTTATTATCTAAGAGAGAGATGATATAGACAAAGATGA 
GACAGATTGACACAGAGAAGATGGGTACATGATAGAT 




KCP 20 62 
67 


TGCTCAACTGTAATCAAACATTATTTTTAAAAAATCATTCCAGC 
CTGGGAAACAGTGAGAAACCCATCTCTACAAAATAAAAAATAAA 
AATTAGCTTGGCATTGTGGCATCTGCCCGTGGTCCCTGCTACTC 
AGGAGGCTGAGATGAAAGGATCACTTGAGCCTGGGAGGTTGGGG 
CTGTGGTAAGCCGTGATTGCCCCATTGCACTCCATTCTAGGCAA 
CAGAGTGAGACCCTGTCTCAAAAAAAATATTATTCATTTAATAT 
CTGTTGCCACCACAGGACTGATCCCTCTGTGAGGGCAGAGATTG 
TTCATGCATGGAATTGTGATTTATAAGCACTGGCTCTGGAGCCA 
GGTTGCCTGAGCACGGAGCCAGCTGTGCCCTGCGGGACACCTGT 
GGCACACTTCACTCCTGGGACACCTGGGACACGCACACAATAGA 
AATGTTCACATTTTACTAGGCAATGCCAGTCACATAGTCCTACC 
TAATTTCAAAAGGGTA [A/GJ AAGGTACACCCAACACGCATCAG 
• GAAGGAGG AGGAC CAGAAATTGTTGGTGACAAGCACAAATGAC C 
ACCCCAATATAATATTTTGTTTGGAAGGCATTTTATTCCACAAA 
AACAACATTACAATAAACACAACAACAAAAGACTGGTTGCAGTA 
GT^CCAACTTTCCAGACCTATCTGCACAGCACAACCATTATCCC 
ACTCAAAATGTCATGTTTTTACCCAAAACATTAAAATTTTAAAA 
GCAATTCAAACCCATAGCTTAAAAAATGTTCCAACCAGTAATAA 
AAGGAAAAGTGTGCCTCCTCCTCCCAACTTCCCTACCCCACAAT 
CGCAAGATATTATCCTTATAGGCGAAAAGGGTTTCAGGATTTGA 
GATGCAGGCTGGGAGGTCTGAGAAGACTTCCTATAGAAGACATG 
ACTTCAAACTCTTTCTTGTATGTGAGATTTAATTTTCAAAGACT 
CCTCTGATCCAACTTAAGCTTTATGGTAAATCACCTT 


SEQID 
N0.21S 


KCP_2 076 
61 


ATATGCCAGCGCTCTATCTGCAGGGGTTCTTTTGATAGCAGCAG 
ACTGAGAGATGATGTTACTGTCCCCTTTTTCCTGTTGTTGGCAA 
CTGAGACTCAGAGGATGGAAGTGACTTGCTC AGGTC CACCACC T 
CTTCAGCTGTGGAGCTGCGACAGGAGCCTTTGTTTGACTTCAAA 
GCTCACCATCACTCCTCTCTCACTGATGCTCAAGTGGGCTATCA 
CCTCGCCTTTCCTGAGCCTTCCTTCGCTATCCTAAAACAGCGCC 
TCCCGAAATC ACC ACTAAAGAACTTATT CATGTAAC CAAACACC 
AGCGGTTCCCCTAAAAACCTATGGAAATAAAAATTAAAAATAAA 
AACAGTGCCTCCCATGACCCATGTCTCTCCAGTCCCATAACTCT 
GCTCTATTTCCATTCACAGCTCCATCCCCACCTTTATGTCTTTT 
GTTCACTGCTTTATCCCCAGTGCCTAGAAGAGTGCTTGGCACCT 
AGTAGACACTCAGTAA [C/G] TATTTGTCGAATGAGTTAATAAG 
GTTGTGAAAAGAACGTTAGATTACTGGAAGGATTCATCTGAGTT 
TAATTCTGCTATGCTGGGAATCCAGTGTGCGGCCTTGGATGAAG 
CCAGTTCCCTCCCTGGGCCCCAGTAGCCACATCTGTa r a ttt a n 

AGGGCAGGAGAAAAGCCACACGCTCTGTGACTTATACAACTTGT 
TGCCCAGAGTGGAGGCTGCTTTGATGCTCAGAAAAAAGAAACAA 
ACATGGAAATGCTAAATGGGTGGCAGAGAGCTTGAGGGAGGAAG 
GAGATGGGGAGGGTACTCTTGAAACTGTTTGGTGTCTTCCCTCC 
TGCCCCCTCAGTACCAATTGTCAAGTACAGAAAGTGAAGGAGAC 
TTGTATTAGTGGAATTTGGTCCCTGACTTGTTATAGAGACACAA 
TTACAAAGACACAAGAGTGGGCCCAGCAGAGACCCTTAGGGTGG 


SEQID 
NO. 219 



tNSDOCID: <WO 2004041 1 93A2_l_> 



WO 2004/041193 



-164- 



PCT7US2003/034681 





TCCCTTGAGGTTCCAAAGCATCTGCCCATCAAGCAGA 




KCP_2079 
65 


CACCAGCGGTTCCCCTAAAAACCTATGGAAATAAAAATTAAAAA 

TAAAAACAGTGCCTCCCATGACCCATGTCTCTCCAGTCCCATAA 

CTCTGCTCTATTTCCATTCACAGCTCCATCCCCACCTTTATGTC 

TTTTGTTCACTGCTTTATCCCCAGTGCCTAGAAGAGTGCTTGGC 

ACCTAGTAGACACTCAGTAAGTATTTGTCGAATGAGTTAATAAG 

GTTGTGAAAAGAACGTTAGATTACTGGAAGGATTCATCTGAGTT 

TAATTCTGCTATGCTGGGAATCCAGTGTGCGGCCTTGGATGAAG 

CCAGTTCCCTCCCTGGGCCCCAGTAGCCACATCTGTACATTTAG 

AGGGCAGGAGAAAAGCCACACGCTCTGTGACTTATACAACTTGT 

TGCCCAGAGTGGAGGCTGCTTTGATGCTCAGAAAAAAGAAACAA 

ACATGGAAATGCTAAATGGGTGGCAGAGAGCTTGAGGGAGGAAG 

GAGATGGGGAGGGTAC [C/T] CTTGAAACTGTTTGGTGTCTTCC 

CTC CTGCCCC CTCAGTACCAATTGTCAAGTACAGAAAGTGAAGG 

AGACTTGTATTAGTGGAATTTGGTCCCTGACTTGTTATAGAGAC 

ACAATTACAAAGACACAAGAGTGGGCCCAGCAGAGACCCTTAGG 

GTGGTCCCTTGAGGTTCCAAAGCATCTGCCCATCAAGCAGATGA 

TGTGATTAGTCTCTGTGACCCCAAGGATGCCTCCTGAAATTGCT 

GATTCAATTTCTC CTAATAAAATAGGAACAATAATTAG C TAATA 

AGAAATCAACAATTAAAGCTATGAGAGAATTAAGTGAGATCATG 

TAAGCAAAGTACATGTCACAGTGCTCTGCAAATAGGCAGTGCTC 

AGAAGTGTCACCTTTTCTCTTTCTTCTCTGAGCCTCCGTCTTCT 

CTTCGGTAAAATGAGAATAATATTATGCATACCTCACAGGGGTT 

AAGCAATGTGAAAGTACTCTGTAAAGTATAAGGCTGA 


SEQID 
NO. 220 


KCP_2115 
25 


GAGATGATCAACAGTCTTTCATCCAGAGGGTTGTGTTTGCTGGT 

GGCCATTACCTTTAACATAAAACGATCATATTTACTTTATCCTA 

TTCATGTCCAACCTCAACTGACAATTGAGTTGTGTCTCTGACAA 

TAAATAGCAGAAAAAGGAAATCTTCCTATACTGAAGAGAAACAC 

AATTAATTAACTAGATCCATCAGGAAAGGTACAATCATGATTGA 

GACAGTGTTTAACAGATGTGACTATTGGATTCTGTTGTTGAGAA 

TGACCCTTAAAATCACAGTCAAAATATACGACAAGATGGAAATA 

ACATTTTTGAGCACCTACTATGCATGTAGAGCATCTTACATACC 

TTATCTCACTTAGATTTACAGCTGCAAGGTGGGTATGATTCTAG 

CTTGAATTAGTCTAATAACCATATACCTCCTAGGGGCAGTGAGA 

TGATTAGATCAATTCTAAAACTATTACCATGCTCTCTGAGCTCA 

CCAAGACAGGCAGTTA [A/G] TACAAGGATACATTAATACCGAA 

TCCAGCAAAAGCTCACATGGCCAGCTTCCATTATGTTCCTATTT 

GTGATTATTCTGTATCAAGCACAGAAATGTATGTTCACACGAAC 

AACAAAGAAGGGGTTTATTAGTGTGGATTACAGGGCCTAAGCCT 

ACCCTCTGAAACTGGTTTTGGAGTCTTTAGCACGCTTGTTTGGG 

ACAGTTAAACATGTGCCAGCTATTCTAAAACAGTAGCAGTAA.TG 

TGATAGAGCTGGGTCATACCGTGCTTCCCAAAGTATGATCACTT 

CATTTCAACAACTTCACACTAACAGCCTGAACTGGGCTGTGAAG 

GGAATATTTAGACCAAGGAAACTGGAAAACTGTATCAATCAGGC 

TTTTCCACCCTCCCCAAGAGCCAGTTGTCAGATATCTACCAGCC 

TACCAACGCTAGCTCTCTAATCAGAAACCATCACTTAGCAAGTT 

CCCAAATTATCTGCAGAGCAATGAACTCCTCTTCTTC 


SEQID 
NO. 221 


KCP_2118 
50 


CTATGCATGTAGAGCATCTTACATACCTTATCTCACTTAGATTT 
ACAGCTGCAAGGTGGGTATGATTCTAGCTTGAATTAGTCTAATA 
ALLAiAiAtL I t-LTAGGGGCAGTGAGATGATTAGATCAATTCTA 
AAACTATTACCATGCTCTCTGAGCTCACCAAGACAGGCAGTTAA 
TACAAGGATACATTAATACCGAATCCAGCAAAAGCTCACATGGC 
CAGCTTCCATTATGTTCCTATTTGTGATTATTCTGTATCAAGCA 
CAGAAATGTATGTTCACACGAACAACAAAGAAGGGGTTTATTAG 
TGTGGATTACAGGGCCTAAGCCTACCCTCTGAAACTGGTTTTGG 
AGTCTTTAGCACGCTTGTTTGGGACAGTTAAACATGTGCCAGCT 
ATTCTAAAACAGTAGCAGTAATGTGATAGAGCTGGGTCATACCG 


SEQID 
NO. 222 



JNSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-165- 



PCT/US2003/034681 





TGCTTCCCAAAGTATGATCACTTCATTTCAACAACTTCACACTA 
AC AGCCTGAACTGGG C [C/T] GTGAAGGGAATATTTAGACCAAG 
GAAACTGGAAAACTGTATCAATCAGGCTTTTCCACCCTCCCCAA 
GAGCCAGTTGTCAGATATCTACCAGCCTACCAACGCTAGCTCTC 
TAATCAGAAACCATCACTTAGCAAGTTCCCAAATTATCTGCAGA 
GCAATGAACTCCTCTTCTTCAGAAAGCAGGCTGAAAGATACACT 
GTTCACATCTTAGCCTGACCTGGACCCAGTGAGTTTCCATCAGT 
GAGAAAATTCTGTGCTAACTTGAGATAATACTATTCTTGTGGCA 
ATTTTACTTTTCCTTTGAGCGATTCCTTCAACCTCTCTCTGCCC 
CTTCATTTTTCCGTCTTAAAACTAAAAGTGCCCTTTCTCCCTGG 
ACACTCCTCATTTGCAATGAATTGTCATTTCAGCTCCTCAGTCA 
AGAGGAGTAATGAAATCCCACCCGTGTTAATCCTCTTATATCCC 
GCAGAAATATTGTAGACCCACTCACCCTAGGCAACAT 




KCPJ212 7 
75 


AGTAATGAAATCCCACCCGTGTTAATCCTCTTATATCCCGCAGA 
AATATTGTAGACCCACTCACCCTAGGCAACATGCCCTCTCTCTT 
CAACACAGGTCATCAATTGTTCATTTACTGGCTATCTCCATGTA 
CTGGAACTTCAGGGTGGTGTCCAGCTGGGTTCAAAGGAGAAACA 
GTGGGAAGTTTCTCCACTGCCACCTGAATTAGATGAGAAAGAGT 
TGTCTACTGAAATACACTAGCTGGTGGCAGGATTGGGACGTCAT 
TTGACTAATTGCCTCCTAGAGCTGCAGAGACTGCTGGAACTACC 
TAAGTAAATCATCAAAAAAAAAAAAAAAAAAAATCATCCCAGGG 
CACTTTTTCCAGACAAAAAGGTCCACTTAAAACATCCTCTAGAG 
ATCTGTGCCTGAAGCTGAGCTGCTGCAATGAAACTGACATTTCT 
GCCTTGCAGCCTGGCCATGGGCTTAGCTGGACTAAAATGCTGCT 
GCAGTGGTGAGGGCAC [A/G] TGAGAGTCCCTAATGTACATGGC 
CTTGCTCCTTGTCCTGACACATCTTTTAGGGCTGCTGCTTTCTC 
TAGTGCTGGAATCTAGATAATTCCTTTCCCAGCCGTTTGTTTCT 
TCAATCTTGGAAAATATCTGGATGAATGTAACACTGTCACACAC 
AAACAGAATTATGACTTACGTCACATTCTATGTCGTGATTTTGT 
GGACTTTTAATAATTGCATTACATTTGTGACCATTAATTTCCAC 
CATCGCCCTGCTCCTGAGAATCTGTAAGGGACATTTGACACTCC 
TCTCCCCACCCACCTCAACATTTGTGCTGACCTGAAGGTCACAT 
TAAAAACATACCCATTTGGAGAGAAAGATCTGTCTACTGAAATA 
CACTAAATATTGAAGAATTTCCAAGTCATTTGATCTTGAAAACT 
CCATCTAATGGAAGCAGAAA.CACTCAAAGGTTTTTTTTTTTGGA 
CTCCCTTTTTCAGGACACTTTCAGGACTGAGGTATAT 


SEQID 
NO. 223 


KCP_2217 
99. 


TCAGACTTTGAACAAACCTCAGAAGGAACTGTCAAGGAGGCTCC 
CCACGGGTTCACGCTCTTCTCTCCTCCTGCACACAGGGAACAGG 
GCCATTCTCCTTCCTTTACTGGGACTACCTGGGCTTCATCCAGG 
GAATCCCCAGGTGGCAACAGGAGGGTGGTGAAAACCGCTGCCCG 
TCACCTGTAAAGTTTCCTGTGAATGTGTCTACAGCGGCCAGCAC 
CACAAGGCATACAAAGAAAGGGAAGGGAGAGCTGATGTGAGAGC 
GGCAGCGTGGGCACTCCTGTGAGGTTGCCACAGCTGTAGACAAG 
TTAAATCAGTGCAGTTCAATCAAAAGTCATGACCCATGAGCGTC 
ACAAC CAGC ACGAGTCTACAAAGGAATACATTAAAACTAAGACC 
AGAGCACAGCTCACATTAGTGAGGGATGGGATCATTTCATGGAG 
TTTTTGTTTCAAAATATTTCATTAACATTTCACTTATATACATG 
TGTGTATACTGGGTTGTGAT [A/T] TAAATTACAATTCTTACTA 
TAAAATACAGCAAAAGAAAGAAGAAACAAAGAGAGGGCCACTGG 
TTTACCTAACATCCACAGGCAGGCTACTTCCCAGCATCTTGAGC 
CCCAAAGAAGTAAATTTCCTTCCACAACCGATGTTACCACAGCG 
TGACACTTAG CCAATGATGAAAACGAAAAACAAAACAAAAGCTT 
GGCAGTCAGTATCCAAATATGCAGATACTACAGAATCTGTTTGA 
TGTAGAAGTTGATCCTGCTACCCAGACAGCAAACAACTCATTTA 
TTAATAAAGTCCAGTTCCTCCTTAATGAAGTGGGTTTAATAGTT 
GATATCTCAATAATTACTTAGTGCATTTTTTATGAAGGTGATGG 
GAAACAAGTGCTGTTTCTTGAGTCGGAAAGAGTCTCTCAAGCTC 


SEQID 
NO. 224 



INSDOCID: <WO 2004041 193A2_I_> 



WO 2004/041193 



-166- 



PCT/US2003/034681 





CCACAAAGAAATTTCCCGAGCTTGTGAGGAATTCAGTCACAGGA 
AGATCAAGGAATT 




KCP_2235 
68 


GATCTAATGCTAGGAGATTCAAACCAACAATTAATTTCTCTGTT 

AAAATGGGTTAAAATAGATGTAAAATATTAATATGTATATAAGC 

ATTCTGAATTAGACTTATGTGAATTTTTCTCCTTTTCTTTCTTT 

CTTTTTGAGAATAAGCCCTTTCATTTACGTAGAAATGCTTCAGC 

GTTTAGATAATTGCTACTTATCTTGTTAGCTACAAACACAACCA 

TAATTAAAGGCTCTGTAAGAATTATGAATTCTGGGGAAATTGGC 

CACTTGTCTCTGTGGCGTAAACAGTATCTAATTTATAACAAATC 

ATCTGCCTTAGTCCCAGCAGGATAAGGTGATATGTATTGCCCAG 

CACATGAGAAAGATGGCAATTAGGAATTGTTACCAAGTTACGGG 

AGCCTCACACGAACATCCATCACCTTTGGGGATATGTACAAGAT 

ACAAACTTAATTTGATGGATTCCTTTTGTATTGGGATCAAAGTC 

TCAAAAGGGAAAGTGACAATTTCAGGGAAAATCTGGTGCAATGA 

GACCAACACTGATGAGAGAAATGCACACAATTTAATACACCTGC 

TCACCTGATGTGGCAACTCAGCCTGTGCTTGCTGTGGGTTGCCA 

CAGGATGAGACATGGTCTGTGCATATTCCCAGCAGCCACCCATC 

TCATCACTATTCTTGCCAGCCCAGATTTACAGTTGTTCAATAGA 

TGGATTTGGTAATATCTGCATGACAACAACAGGCAGAGAAGGTT 

AGATGGCAATTGATTCTTGATTGGTGTAAGTTTATAGAACACAT 

TCTGGCAGGGCCCJ\AAGGAAATCACTCACCTACCCCTCTGTGAT 

GGTAAAACGTTGAAAATTCCACGGACTTGGACCTTGTGATCCTT 

CAGTGGAAGATGGGCAGATTCCTTGCTTTAATTGACAGACACTT 

TCTAAATAACTAATGCAATCTTATATTACATTATAGTCCATAAG 

GGAGACATACTTAAACTACTACTTACAACAAC [G/T] GTTTTTA 

GAGCCTTTCAAATGGTTTGTACAAAGTAGCTCCCATTTAAGATA 

TTTTCCTAGTATTTAAGGCTATCTAGTAGACATTACAAAACAAT 

ACGCTGTAAATACATTCAGATTTTTATCAGTAATACTTAACATG 

CCGTAATTTGAACTTTCTGCTAAATCATGCTATCCATTCCTAGT 

TGGCCCCAATGGTGAGAGTTTACTGTTTCTTTAAATAATTTTGT 

TTCCCTTTGCTGTCTAGAGGTGTTTATCATTCTGCTTACTTGCC 

TGTGTCTCTGGAATATTCAGAAGGTTCCATGGGAAACAATTTGA 

ATATGCAAAGAAGTTATTTTTAAAGCAAGGAAAATGTTTTCATA 

TGGATTTATTTTGAGCACTTCTGCCTTTGCCTCCACTGGGAACA 

TGTTTCTCTCCAACGCCGAAGCCCCCTCCCTGTGTGGTGTTTGA 

CGCAGAGGCTGACAGGGCAGGGAAGTGGGGTTCAAGATAGGAAG 

GCCATTGGCAGTGTGACCCCAGCCCACAGTCCTAGATCCCAGGT 

CGTGACAC CACTCTTTTGACAGCC CAG ATTGTTACCTAACAAG A 

ATGACTCCCAAGCTCAACCATTCCAATGCCATCTCCTCTGGTTC 

CAGATAAGATTGAAGATGAGCTGGAGATGACCATGGTTTGCCAT 

CGGCCCGAGGGACTGGAGCAGCTCGAGGCCCAGACCAACTTCAC 

CAAGAGGGAG CTGC AGGTC CTTTATCG AGGCTTCAAAAATGTAA 

GACCCGTGCACGCTCTGAAGGCCTGGGGGGGGTTCCCACGTGAG 

GCTACACTCTCCCCAATGCCAAGGGAGCTCATAAGGCGTTTCCC 

ATATGTGAGGCTGTACAAGGAAGGCCAGCTCTATAAAGGGGGCA 

TG AGAGGGAGATCACCTGGCTAGAAAGGAAGG CTCCAGGCGAGG 

ATGGAGCAACCTCAGGAGACAGTAAACGGCCAACTGCCCAGAAA 

TTTCACAGGGTGGCACATCCTCAAG 


SEQ ED 
NO. 225 


KCP_1152 


GATTTTTATCAGTAATACTTAACATGCCGTAATTTGAACTTTCT 
GCTAAATCATGCTATCCATTCCTAGTTGGCCCCAATGGTGAGAG 
TTTACTGTTTCTTTAAATAATTTTGTTTCCCTTTGCTGTCTAGA 
GGTGTTTATCATTCTGCTTACTTGCCTGTGTCTCTGGAATATTC 
AGT^AGGTTCCATGGGAAACAATTTGAATATGCAAAGAAGTTATT 
TTTAAAGCAAGGAAAATGTTTTCATATGGATTTATTTTGAGCAC 
TTCTGCCTTTGCCTCCACTGGGAACATGTTTCTCTCCAACGCCG 
AAGCCCCCTCCCTGTGTGGTGTTTGACGCAGAGGCTGACAGGGC 
AGGGAAGTGGGGTTCAAGATAGGAAGGCCATTGGCAGTGTGACC 


SEQ ED 
NO. 226 



tNSDOCID: <WO 2004041 1 93A2J_> 



WO 2004/041193 



-167- 



PCT/US2003/034681 





CCAGCCCACAGTCCTAGATCCCAGGTCGTGACACCACTCTTTTG 
ACAGCCCAGATTGTTACCTAACAAGAATGACTCCCAAGCTCAAC 
CATTCCAATGCCATCT [C/T] CTCTGGTTCCAGATAAGATTGAA 
GATGAGCTGGAGATGACCATGGTTTGCCATCGGCCCGAGGGACT 
GGAGCAGCTCGAGGCCCAGACCAACTTCACCAAGAGGGAGCTGC 
AGGTCCTTTATCGAGGCTTCAAAAATGTAAGACCCGTGCACGCT 
CTGAAGGCCTGGGGGGGGTTCCCACGTGAGGCTACACTCTCCCC 
AATGCCAAGGGAGCTCATAAGGCGTTTCCCATATGTGAGGCTGT 
ACAAGGAAGGCCAGCTCTATAAAGGGGGCATGAGAGGGAGATCA 
CCTGGCTAGAAAGGAAGGCTCCAGGCGAGGATGGAGCAACCTCA 
GGAGACAGTAAACGGCCAACTGCCCAGAAATTTCACAGGGTGGC 
ACATCCTCAAGGAATTCACCCTGGCCCAGGGTCAAGCCTTAGCC 
CTTAACATAATCATACCTTCCAACCTGGTGGTGCCCCCACAATA 
ATGGGATTTGGCCCTGCTGACTTATGCTAACCAGGCT 




KCP_13 33 


GGCAGGGCCCAAAGGAAATCACTCACCTACCCCTCTGTGATGGT 

AAAACGTTGAAAATTCCACGGACTTGGACCTTGTGATCCTTCAG 

TGGAAGATGGGCAGATTCCTTGCTTTAATTGACAGACACTTTCT 

AAATAACTAATGCAATCTTATATTACATTATAGTCCATAAGGGA 

GACATACTTAAACTACTACTTACAACAACTGTTTTTAGAGCCTT 

TCAAATGGTTTGTACAAAGTAGCTCCCATTTAAGATATTTTCCT 

AGTATTTAAGGCTATCTAGTAGACATTACAAAACAATACGCTGT 

AAATACATTCAGATTTTTATCAGTAATACTTAACATGCCGTAAT 

TTGAACTTTCTGCTAAATCATGCTATCCATTCCTAGTTGGCCCC 

AATGGTGAGAGTTTACTGTTTCTTTAAATAATTTTGTTTCCCTT 

TGCTGTCTAGAGGTGTTTATCATTCTGCTTACTTGCCTGTGTCT 

CTGGAATATTCAGAAGGTTCCATGGGAAACAATTTGAATATGCA 

AAGAAGTTATTTTTAAAGCAAGGAAAATGTTTTCATATGGATTT 

ATTTTGAGCACTTCTGCCTTTGCCTCCACTGGGAACATGTTTCT 

CTCCAACGCCGAAGCCCCCTCCCTGTGTGGTGTTTGACGCAGAG 

GCTGACAGGGCAGGGAAGTGGGGTTCAAGATAGGAAGGCCATTG 

GCAGTGTGACCCCAGCCCACAGTCCTAGATCCCAGGTCGTGACA 

CCACTCTTTTGACAGCCCAGATTGTTACCTAACAAGAATGACTC 

CCAAGCTCAACCATTCCAATGCCATCTCCTCTGGTTCCAGATAA 

GATTGAAGATGAGCTGGAGATGACCATGGTTTGCCATCGGCCCG 

AGGGACTGGAGCAGCTCGAGGCCCAGACCAACTTCACCAAGAGG 

GAGCTGCAGGTCCTTTATCGAGGCTTCAAAAATGTAAGACCCGT 

GCACGCTCTGAAGGCCTGGGGGGGGTTCCCAC [A/G] TGAGGCT 

ACACTCTCCCCAATGCCAAGGGAGCTCATAAGGCGTTTCCCATA 

TGTGAGGCTGTACAAGGAAGGCCAGCTCTATAAAGGGGGCATGA 

GAGGGAGATCACCTGGCTAGAAAGGAAGGCTCCAGGCGAGGATG 

GAGCAACCTCAGGAGACAGTAAACGGCCAACTGCCCAGAAATTT 

CACAGGGTGGCACATCCTCAAGGAATTCACCCTGGCCCAGGGTC 

AAGCCTTAGCCCTTAACATAATCATACCTTCCAACCTGGTGGTG 

CCCCCACAATAATGGGATTTGGCCCTGCTGACTTATGCTAACCA 

GGCTCACCGAGACTGATGTGTAAGCCGAATGTCGGTGTATTAAT 

TTAC CTTGGGAAATGG AACTGACAGTGGAAACAG AC ACTC CTCT 

CCCTTCGCTGGGACCCGCTCTCCTTGGAAGCCACATGGAAGCCA 

GGTTACAATCAAAAGTGGAGTCAGAGGACGGGAGTTCCTTGTTT 

AGTTGTTACTTTAAATACATTAATGTGTTCCTGCAGTCTCAGGC 

CAGTTTGAGAGCTCTCAGATACAATCCTGGATATTAATTTATTT 

TTTAAGTTTAACTCTC7VGAGTGCAATCTTATTCCCAAATCCTGG 

AGTGGTGTGGAGTGGGGTGGGCTACAGCGACATGCACCTGGTCA 

CCCTCCCTCCAGGTGCAGTCTGTAGGTAGAGCTGAGCTGGGTCA 

GTTCCAAACTGACCACAGCCTCAATGTTCTCCAAACTGCTGACC 

CACAGGGATTCCAGCCCCTCCTGGGAGTTATCTGACAGGTGCTG 

GGATGCCTCTTCCTTCCACACTAGCCTTGACTGCACATGCCAAG 

TGCCCAGTTTCCTACCATTAGGGCTTCTTTCCTTCGATGGCAGC 


SEQDD 
NO. 227 



INSDOCID: <WO 2OO4041 1 93A2_I_> 



WO 2004/041193 



-168- 



PCT/US2003/034681 





ATTAGCAGTGGGCAGCCGAGTTGGAGAAGGATCCTGTGGGAAAG 
TTTTCCAGGCAGGCACTGGGCTCAGAGGGAACAGCATCCAGAAA 
AG AGAAGAAATCTACACTG CTTGGC 




KCP_2252 
20 


AATTTACCTTGGGAAATGGAACTGACAGTGGAAACAGACACTCC 
TCTCCCTTCGCTGGGACCCGCTCTCCTTGGAAGCCACATGGAAG 
CCAGGTTACAATCAAAAGTGGAGTCAGAGGACGGGAGTTCCTTG 
TTTAGTTGTTACTTTAAATACATTAATGTGTTCCTGCAGTCTCA 
GGCCAGTTTGAGAGCTCTCAGATACAATCCTGGATATTAATTTA 
TTTTTTAAGTTTAACTCTCAGAGTGCAATCTTATTCCCAAATCC 
TGGAGTGGTGTGGAGTGGGGTGGGCTACAGCGACATGCACCTGG 
TCACCCTCCCTCCAGGTGCAGTCTGTAGGTAGAGCTGAGCTGGG 
TCAGTTCCAAACTGACCACAGCCTCAATGTTCTCCAAACTGCTG 
ACCCACAGGGATTCCAGCCCCTCGTGGGAGTTATCTGACAGGTG 
CTGGGATGCCTCTTCCTTCCACACTAGCCTTGACTGCACATGCC 
AAGTGCCCAGTTTCCT [A/G] CCATTAGGGCTTCTTTCCTTCGA 
TGGCAGCATTAGCAGTGGGCAGCCGAGTTGGAGAAGGATCCTGT 
GGGAAAGTTTTCCAGGCAGGCACTGGGCTCAGAGGGAACAGCAT 
CCAGAAAAGAGAAGAAATCTACACTGCTTGGCATCTACCATGGA 
CTCAATAC CACCTAACATAGGTTCATAAGATAC CCTTGGGGAAG 
TTATTGTTACCCCCATTTTACAGGTAAGGATATTGAGGATCAGA 
GACTGGCTTGGCCAAAGTCACAAAGCTTAGTATTGGCTGAGCCA 
GGATTTAAACCCAGGTTTTTCTGATCTTAAAGCCCCAAATCTCT 
CCACCTCACAGTGCCCATTCTCTGACAATGTCTCATCATTTTGC 
AAAGCAGCTCCAGTCCTGAGATGGCACTACTTGGGAGAAGTGGA 
AATGCACAGGTCCCTGTCCCTGGGGATCATGAGGAACCCCAGAC 
ACCAAGGCTGGGCCCAGTCTTCTCCTAGTGCTGGCCC 


SEQID 
NO. 228 


KCP_2649 


GGCTCACCGAGACTGATGTGTAAGCCGAATGTCGGTGTATTAAT 
TTACCTTGGGAAATGGAACTGACAGTGGAAACAGACACTCCTCT 
CCCTTCGCTGGGACCCGCTCTCCTTGGAAGCCACATGGAAGCCA 
GGTTACAATCAAAAGTGGAGTCAGAGGACGGGAGTTCCTTGTTT 
AGTTGTTACTTTAAATACATTAATGTGTTCCTGCAGTCTCAGGG 
CAGTTTGAGAGCTCTCAGATACAATCCTGGATATTAATTTATTT 
TTTAAGTTTAACTCTCAGAGTGCAATCTTATTCCCAAATCCTGG 
AGTGGTGTGGAGTGGGGTGGGCTACAGCGACATGCACCTGGTCA 
CCCTCCCTCCAGGTGCAGTCTGTAGGTAGAGCTGAGCTGGGTCA 
GTTCCAAACTGACCACAGCCTCAATGTTCTCCAAACTGCTGACC 
CACAGGGATTCCAGCCCCTCCTGGGAGTTATCTGACAGGTGCTG 
GGATGCCTCTTCCTTCCACACTAGCCTTGACTGCACATGCCAAG 
TGCCCAGTTTCCTACCATTAGGGCTTCTTTCCTTCGATGGCAGC 
ATTAGCAGTGGGCAGCCGAGTTGGAGAAGGATCCTGTGGGAAAG 
TTTTCCAGGCAGGCACTGGGCTCAGAGGGAACAGCATCCAGAAA 
AGAGAAGAAATCTACACTGCTTGGCATCTACCATGGACTCAATA 
CCACCTAACATAGGTTCATAAGATACCCTTGGGGAAGTTATTGT 
TAC C CC CATTTTAC AGGTAAGGATATTGAGGATCAG AG ACTGGC 
TTGGCCAAAGTCACAAAGCTTAGTATTGGCTGAGCCAGGATTTA 
AACCCAGGTTTTTCTGATCTTAAAGCCCCAAATCTCTCCACCTC 
AC AGTGC C CATTCTCTGACAATGTCTCATCATTTTGCAAAGCAG 
CTCCAGTCCTGAGATGGCACTACTTGGGAGAAGTGGAAATGCAC 
AGGTCCCTGTCCCTGGGGATCATGAGGAACCC [C/T] AG AC AC C 
AAGOCj.C3rGGCCCAGTCTTCTL.L, 1ACj1vjU±UoUUU1L~A/^±<jCL. X 
CCCGCTGACTCTCTCCCCTTCCCACAGGAGTGCCCCAGTGGTGT 
GGTCAACGAAGACACATTCAAGCAGATCTATGCTCAGTTTTTCC 
CTCATGGAGGTGAGTCTGACCTTGAAATCTATCTTGCCCAGCTC 
CCTCTCTGGTAAGCAGCCTTCCCTTCCTCCAAGTCCTCTCTTCC 
TTGCCATTTGCTTCCTTCTCGAGGAAGAGACAAACTCAGGGCAG 
GACACCTCCCTCATCGTGAGAGGTGGGAGTCTCCAAAGCTTTAG 
CAGGAAAGAACTCTGAAAATGAACCCACCCTGGAAGGGGAAGAA 


SEQID 
NO. 229 



BNSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-169- 



PCT/US2003/034681 





GGGCTGATAATGCAACATCACAACGTCTCAGAACAGCTCTAGAA 
AGCAGGTATTATAATCCCAGATGGAGTAACTGAGTTTCGGGGAA 
GATAAGCAGTGTACTCAAGATTGCACAGCTGGTGAGTAGCAAAC 
CAGGATTAGATTCCATAAGGGTCTGAAACAGGTTTTGCCATGCT 
GGCACCACCATTGTGCAGGGCACTTTTGAATCTTTTCCTTAAAA 
TAGCTGAGACAAGCTGGAATTTTGTAAAAGAACTTGAGTAAATA 
CCGAAGACTATAAAAATAAACTAATTGAAAAAGAGGCAGGAAAC 
ATAAAGTTGTGCTTATTAAGCCAGTTTACAAGTGTGCCAGGCCC 
ACAACAGCTGCTCTGTTGCCCTGCCCGACTCCTGTGGGAACCAG 
CTGTGTCCCCATGGGCCTGGGACCACATCGGTGACTCCTCCTGT 
GGCCTCCATGTGTCACATGCCACTTTGCATCCTGTCACCAAGAG 
CTGTCTCCTGCAAGACATCTTCCCTGGATCCTGACAAAATGCAA 
ATCCAAGTATTCCAAACACTTCTTGGGCCCTGTTTCTCATGGGC 
CTTTTTGGCAGCAGACAGATGCCTTCCTTGGTGTGTGGGGCCCC 
TACC CAG ATCAGGTGGGGGAGGC AG 




KCP_22 7 8 
71 


CCTCTGGTTCTGCATCACCTCCCCCTCTAAATCTCAAGGCATTG 
GGGGAAGGTCTGGACCATCAAAAGCTCTCAGTCAGACCAAAGAC 
ATGTTTATCCATTTGTAAGCATTTCCTAAAGATGGGGAAAAGCA 
GCAGCAACTTTCCCTGGCCTGCAGGAACTCAGGGACTCAGGGGA 
CTAATAACAACAGTGTATGAGCTTCCGGGCACACTGCTTCCCAG 
TGGCAGCCCCTGTACTTAGGGCTTTGTATGTATTAATTCATTTA 
CTCCAATTC C C ACAATAACCCTATAGGGTAGGGTTTTATTATTG 
ATTACCTTTTTACAGAAGAGGAGAGTAAGGCAAAGAGAGATAGA 
GTAGTTTTCCCAAGGTCAAAGAGCACATAAATGATAAAGGATGG 
ATTTGAATGTAGGCAGAATGACCCTCAATACAGACTGTTCCTAC 
AGTCC ACGTC CTCAGCCACTAGACCATACGGC C ACTGGGATGAT 
AG AC AG AC C ACTG CAG [C/G] CATGGATAAGGCAAAAACAGGGC 
TGGCTGTGTTGATCTGTGTCTCTCAGAGCTCCATTCTTCCTCAA 
GGGGGCACCTTGCAAAAAAAAACAAAAAAATGGGGCAGGGTAGG 
GAACTGAAGGCAGGAGCTCTTCACAGAGCATAG CCACATCCTC C 
AGGCAGACAAGAGGACGCAGGAGGCACCATTCTGTGAGAGTATC 
ACAGTCTGACCCAAAGACAC^lGCTTCACACTGTCTGATGGCTTG 
ATGGTTAATGTCACTCTGCCTTTTCCCCTTCTCAGGACTTTGTA 
ACCGCTCTGTCGATTTTATTGAGAGGAACTGTCCACGAGAAACT 
AAGGTGGACATTTAATTTGTATGACATCAACAAGGACGGATACA 
TAAACAAAGAGGTAAGTGAGCTGGGGCCAGGGGTGTGAGAGGGC 
TCC AGTGAAGGT AACTAACC CAACAGAAAAC AGCCC C AGGCATG 
AGGATAGCAC TGTCTGAATGAGG CAGGCTCTGCTTTG 


SEQ ED 
NO. 230 


KCP 2279 
87 


TGTGCCATTCATACACCAACGACTCCATGCATAGACAGGCAGGA 
GAATGGTTTTCTCATGATGGCTAGAGGGAGGGGCAAGGGCTCAT 
CTCACTTTTTGCTAGATCTAACTTCACAC C CAAACCCAAAGAGT 
TGAGTCAATGGGCCCCACTCCATAATTTTCTCCTTTCCATCACC 
CTAGCATCACTCTCCTCTCTTTCTTGTCGAAGCCCTGCCTTGTT 
TGGAAGGTTCTCCCTGTGTGGAATTCCTGCCCCCATCACCTGCC 
CTCCTTTTCTGCCTTGTAGATGCCAGCACGTATGCCCATTACCT 
CTTCAATGCCTTCGACACCACTCAGACAGGCTCCGTGAAGTTCG 
AGGTACGCTCATCTGGGGTCCACTCTAGGGGTCCTCTGGTTCTG 
CATCACCTCCCCCTCTAAATCTCAAGGCATTGGGGGAAGGTCTG 
GACCATCAAAAGCTCTCAGTCAGACCAAAGACATGTTTATCCAT 
TTGTAAGCATTTCCTAAAGATGGGGAAAAG CAGCAG CAACTTTC 
CCTGGCCTGCAGGAACTCAGGGACTCAGGGGACTAATAACAACA 
GTGTATGAGCTTCCGGGCACACTGCTTCCCAGTGGCAGCCCCTG 
TACTTAGGGCTTTGTATGTATTAATTCATTTACTCCAATTCCCA 
CAATAACCCTATAGGGTAGGGTTTTATTATTGATTACCTTTTTA 
CAGAAGAGGAGAGTAAGGCAAAGAGAGATAGAGTAGTTTTCCCA 
AGGTCAAAGAGCACATAAATGATAAAGGATGGATTTGAATGTAG 
GCAGAATGACCCTCAATACAGACTGTTCCTACAGTCCACGTCCT 


SEQ ID 
NO. 231 



JNSDOC1D: <WO 2004041 193A2J_> 



WO 2004/041193 



-170- 



PCT/US2003/034681 





CAGCCACTAGACCATACGGCCACTGGGATGATAGACAGACCACT 

GCAG CCATGGATAAGGCAAAAACAGGGCTGG CTGTGTTGATCTG 

TGTCTCTCAGAGCTCCATTCTTCCTCAAGGGGGCACCTTGCAAA 

AAAAAAC AAAAAAATGGGG CAGGGTAGGGAAC [C/T] GAAGGCA 

GGAGCTCTTCACAGAGCATAGCCACATCCTCCAGGCAGACAAGA 

GGACGCAGGAGGCACCATTCTGTGAGAGTATCACAGTCTGACCC 

AAAGACACAGCTTCACACTGTCTGATGGCTTGATGGTTAATGTC ' 

ACTCTGCCTTTTCCCCTTCTCAGGACTTTGTAACCGCTCTGTCG 

ATTTTATTGAGAGGAACTGTCCACGAGAAACTAAGGTGGACATT 

TAATTTGTATGACATCAACAAGGACGGATACATAAACAAAGAGG 

TAAGTGAGCTGGGGCCAGGGGTGTGAGAGGGCTCCAGTGAAGGT 

AACTAACCCAACAGAAAACAGCCCCAGGCATGAGGATAGCACTG 

TCTGAATGAGGCAGGCTCTGCTTTGGGGCTAACAGAGCTGGTCC 

CTGGCAAAATAAAGAAGGCCTCCCTCATTGCCCTACCCTGCCCT 

GTTCCCAAGCGCCCAGAAAGGATTAAACAGATTCATTCTCACTG 

GGTCACCTAGATTCAGTAGATATTACACAGTGGATAAAAATGAC 

TTGTTTCAGTGTGAAGAGTTACTCTTCCCTAGGGAACCTGCATT 

TGGGAAGGTTAGGAGCCACAAGTCAAAGCTAAAAGTTGAAATGG 

TGGAATTGTAGGCAGCACCTAGAATAGAAAAGAAAGATTTTTAA 

GGAAGAGGAACCTACAATTGGGTCATATTGGCCTTAAACTATTT 

TGCCTATTAATACAACCGCCAAGGGGGTAATGGAAGGTACAGCT 

GTCTTTACAGAAATTATCACAAATAATTTCTGAATCTTCACTGC 

TTTGCACTTTTAGAACCTCAGAGGACATGTCTCTAGCCAGTGAA 

ATACCCTCAGGTCTATCTCAAAACTCACTTTGGTATCCACTGTA 

TC CTGGTATCTCAGTGGAAGCTGGAAATTGG CATC CTGTAACAC 

TCCACTTGCTGAGCTCCTGTGTGCCAGGCACGGTGCCTGGAGGT 

ATAGATATCAGCACCAATCTTCACC 




KCP_22 81 
07 

** 


TAGGGCTTTGTATGTATTAATTCATTTACTCCAATTCCCACAAT 
AACCCTATAGGGTAGGGTTTTATTATTGATTACCTTTTTACAGA 
AGAGGAGAGTAAGGCAAAGAGAGATAGAGTAGTTTTCCCAAGGT 
CAAAGAGCACATAAATGATAAAGGATGGATTTGAATGTAGGCAG 
AATGACCCTCAATACAGACTGTTCCTACAGTCCACGTCCTCAGC 
CACTAGAC CATACGGC C ACTGGGATGATAGACAGACCACTGC AG 
CCATGGATAAGGCAAAAACAGGGCTGGCTGTGTTGATCTGTGTC 
TCTCAGAGCTCCATTCTTCCTCAAGGGGGCACCTTGCAAAAAAA 
AACAAAAAAATGGGGCAGGGTAGGGAACTGAAGGCAGGAGCTCT 
TCACAGAGCATAGCCACATCCTCCAGGCAGACAAGAGGACGCAG 
GAGGCACCATTCTGTGAGAGTATCACAGTCTGACCCAAAGACAC 
AGCTTCACACTGTCTG [A/T] TGGCTTGATGGTTAATGTCACTC 
TGCCTTTTCCCCTTCTCAGGACTTTGTAACCGCTCTGTCGATTT 
TATTGAGAGGAACTGTCCACGAGAAACTAAGGTGGACATTTAAT 
TTGTATGACATCAACAAGGACGGATACATAAACAAAGAGGTAAG 
TGAGCTGGGGCCAGGGGTGTGAGAGGGCTCCAGTGAAGGTAACT 
AACCCAACAGAAAACAGCCCCAGGCATGAGGATAGCACTGTCTG 
AATGAGGCAGGCTCTGCTTTGGGGCTAACAGAGCTGGTCCCTGG 
CAAAATAAAGAAGGCCTCCCTCATTGCCCTACCCTGCCCTGTTC 
CCAAGCGCCCAGAAAGGATTAAACAGATTCATTCTCACTGGGTC 
ACCTAGATTCAGTAGATATTACACAGTGGATAAAAATGACTTGT 
TTCAGTGTGAAGAGTTACTCTTCCCTAGGGAACCTGCATTTGGG 
AAGGTTAGGAGCCACAAGTCAAAGCTAAAAGTTGAAA 


SEQID 
NO. 232 


KCP 2325 
21 


ATTTCTTAAAGTAGATAAATTTGACTTTATCAAAGTTAAAAATT 
TTGTGCTTTAGAAGACACCTTTAAGAAAATGGAAATGCAAGCCA 
TGGACTTGGAAAAAATGTTTGCAAATTATATACCAGATATATAA 
AGATAC CAGGATACCAAAC CAATATAAAG ACTGGC ATCC AAAAT 
ATATAAGGGACATTTATAATTTAATACAAAGATAAACAACTTCA 
TATAAAATAGGCAAAAGATTTGATGAGATATTTAAGAAAAGAAG 
ATATATGAATGGCCAGTAAACCCATGAAAGGTTGCTCTATATCA 


SEQID 
NO. 233 



3NSDOCID: <WO 2004041 1 93A2J_> 



WO 2004/041193 



-171- 



PCTYUS2003/034681 





CTGGTCTTCAAAGAAATGCAAATTATAACTATAATGAAATACAA 

TTGCACAGAATGGCCACAATTAAAAAGACTGATAATACCAAGCA 

TTGGCAAAGATGTGGAGCAATAGAAACTCTCATAGATAGCTGGC 

AGAAATGTAAATGGTACAAACACGTTGGGAAACATTTTGGCATC 

TTTGATAAAGCTCAGCACACACTTAACATACAACCCAGAAATCC 

CATTC CAGTCAGGCATGGTGGCTT ACG CCTATAAT C C CAGTACT 

TTGGGAGGCTGAGGCAGGCGGATCACTTGAGCTCAGGTGTTCAA 

GACCAGACTGGGCAACATGGCGAGACACTGTCTCTACTAAAAAT 

ACAAAAAAAAAAAAAAAAAAAGCCAGACATGGTGGTAAGCACCT 

GTGGTCCCAGCTACTAGGGAGGCTGAGGTGGGAGAATTGCTTAA 

CCCTGGGGAGTGGAGGTTGCAGTGAGCTGAGATTGCACCACTGC 

ACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCAAAAAAAGA 

AAAAAAGAAGAAGAAAAGAAGTC C CACTCCTGGATATTTACC CC 

CAAAAGAAAAATATGTAATTC CATAAAGACTTGTACAAAGATGT 

TCATAGCAGCTTTATTCATAGTAATCTCAAAACTTAAATGACCC 

AAATGTCTGTCAACAGGACAATGGGTAAATAC [A/T] TCATAGT 

CTGTTCATCCAATGGAATATTACTCAGCAGTAAAAAGGAATGTT 

ATAGTTGCATGCAGCAATGTGTATGAAGCTCATAAACCTCATGC 

TGAGTAAATGAAGCCAGACGCAAATGAGTTTACACTGTTTTACT 

CCATTTACATGAGATTTTAGT^AAATACAAACTAATCTATAGTAA 

CAGAAATTAGATCTGTGGTTGCCTGGTGTCAAAGCTTGAGAGGC 

ACTCACTGCGAAGAAGTGTGAAGGGATGTCTTTTGGTTGTGAAA 

ATGTTCTATATCTTGAGTGTGGTGGAGGTTACATGGGTGGATAC 

ATTTGTCAACATTCATCAAACAGTACACTTAAAATGGGTGAATT 

TGTTATAAGTAAATTATGCTCCAATAAATTTGATTTATTTGTTG 

AAAAACTTGGTGTAAGGGGGAAGTGCCTAACCAATAGAAGACAC 

TCAAAAAATGTGTTGAAGGAAAAAAATCCTGTGAAATAAAGCAG 

GTAAGAGAAAATAAGAACTCAATATCATCCAAAATATAGATTAC 

AAATCCTAAATGAGATAATAGGAAATTAATCCCAGTGCTCTGTT 

TAAAGGCTCATACCTGTAATCCCAACACTTTGGGAGACTGAGGC 

AGG AGGATGGGTTGAG C CCAGGAGTTCAAGACCAGCC'TRfiTr 7\ a 

CATAGGGAGAGCCTGTCTCTTCAAAACAAAAATTTAAAAATTAC 
CTGGGTGTAGTGGCACGTGCCTGTGCTCCCAGCTACTCCAGAGG 
CTGAGGCAGGAGGATAGCTTGAGCCCAGGAGTTCAAGCCTGCCC 
TGAG CCATAATCACTGCACCACACTCCAGCCTGGG CAAC AGAAC 
AAGACCCTTCCTCAAAAAAGCAATAAAATAAAATAAAGAAATGC 
ACATGACTAACATAGGGTTTATTCCAGGAATGCAGGAATAGCCC 
AGTAG CAGAGAAAGC CTATTAAATAATTTATCACATTAATATAT 
CAAAAGATCAAACCATTTGATGCTA 




KCP_23 36 
55 


TTTACTCCATTTACATGAGATTTTAGAAAATACAAACTAATCTA 
TAGTAAGAGAAATTAGATCTGTGGTTGCCTGGTGTCAAAGCTTG 
AGAGGCACTCACTGCGAAGAAGTGTGAAGGGATGTCTTTTGGTT 
GTGAAAATGTTCTATATCTTGAGTGTGGTGGAGGTTACATGGGT 
GGATACATTTGTCAACATTCATCAAACAGTACACTTAAAATGGG 
TGAATTTGTTATAAGTAAATTATGCTCCAATAAATTTGATTTAT 
TTGTTGAAAAACTTGGTGTAAGGGGGAAGTGCCTAACCAATAGA 
AGACACTCAAAAAATGTGTTGAAGGAAAAAAATCCTGTGAAATA 
AAGCAGGTAAGAGAAAATAAGAACTCAATATCATCCAAAATATA 
GATTACAAATCCTAAATGAGATAATAGGAAATTAATCCCAGTGC 
TCTGTTTAAAGGCTCATACCTGTAATCCCAACACTTTGGGAGAC 
TGAGGCAGGAGGATGGGTTGAGCCCAGGAGTTCAAGACCAGCCT 
GGTCAACATAGGGAGAGCCTGTCTCTTCAAAACAAAAATTTAAA 
AATTACCTGGGTGTAGTGGCACGTGCCTGTGCTCCCAGCTACTC 
CAGAGGCTGAGGCAGGAGGATAGCTTGAGCCCAGGAGTTCAAGC 
CTGCCCTGAGCCATAATCACTGCACCACACTCCAGCCTGGGCAA 
CAGAACAAGACCCTTCCTCAAAAAAGCAATAAAATAAAATAAAG 
AAATGGACATGACTAACATAGGGTTTATTCCAGGAATGCAGGAA 


SEQID 
NO. 234 



5NSDOC1D: <WO 2004041 1 93A2_L> 



WO 2004/041193 



-172- 



PCT/US2003/034681 





TAGCCCAGTAGCAGAGAAAGCCTATTAAATAATTTATCACATTA 

ATATATCAAAAGATCAAACCATTTGATGCTAAAATCACATTTGA 

TATAATTTACCATTTATTCATAATAATTTTCAGGATTCAATTAA 

TTAGGAATAAAATACTTCTTCAGCATAATAGAAAATACCCCAGC 

CTGGTACACAGGTTCATACTTTATGGTAACAC CA/G] CGGAGAT 

TCTCACTGAAGAAAAGATGAGGCAAGAAAAGATGATGAAGAAAA 

GATGAGGCAAGAAAAGATGATGTCTGCACACTGTCAGACATCAC 

CACTGTTTAACATTTCCTGAAAGCTCTTCAAACACAGTGAAACA 

GAAAAGGAAATG CGATCTAAATAGGAAAAATTACAACATTCCTT 

GTTAATGACATGATTTTCTATCTGAGAAAAAAGACAGCAAGAAA 

ATCAACTTAAAACAACTAGAACTTTTAAAAAGCTGGCAAAGTGA 

CTGGTAATAAAATACATATGCAAAAAGAAATTGTGTAGCC^TA 

TATCAGTTGTGACTAGCTAGAAAATTGTAATACT^AATATTCTCA 

TTGTGATCACAATAAAATTTAAAGCACATGGGCATTTTTAAATA 

TCCATAATTTAGATGAAGAGAAAGAAAATTTTGATAAGTAGAGA 

AACATACCATCTTCTGAAAGGATGTATATTATAAAGATAGCAAT 

ATTATAATGACAGCAATTCTTCTCTAATTAAATTTATTTTATTT 

TGAATCAAAATGGAAGTGTTATTTGGGAAGGAAATTTGGCACAA 

TTGTTATAAAGTTACATTGGAAGATTAATCAGATGAAAATAGCA 

AAGATAATTTTCAAAAAGAAGAAAAATGGTGGGATTTGTTCTAC 

CAGATACTGAAATATATTATAAAGCTGAAACTATTAAAATATTA 

TAATATCAGAGAAGGAACAGGTAGATCAATGGAACAAAATAGAA 

ATCCCAGGTACAAATACCATCTTGGTTCATAATAAAGGGAGGAT 

ATTGAATAGAGAGGTAATGAATCATTAAATGATTCTTGGAAAAC 

TGGTTAACTATTTTGGCAATAAGTAAGTAAATATTCTTACTCGG 

TACCATAAACACAAAATCACTATAGATATGTACAGTTGCTTTTT 

AACTAAAAAAGAACTAAAAATCATATGTGAATATCTGATCAAAG 

AATGGAAAAAGCATAAAATCAAAGT 




KCP_23 75 
05 


GCCTGTAGTCCCAGCTACTTGAGAGGCTGAGGCGGGAGGATCAC 
TTGAACCCGGGAGGTCGAGGCTGCAGTGACGGGGATTGTGCCAC 
TGCACTCCAGCCTGGGTGACAGAGCAAGAACCTGTCTCAAAAAA 
AAAAAAAAAGAAAAAAGAAAAAAAGAATGAGAAACTCZATACAGA 
TTAGAAGAGACTAAGGAGACACAACAAATAAATGCAATGTAGAA 
T CATTGAAGGGAAAAAAATATTAGTTGAAAAGCTG AGATCCCGC 
CACTGCACTCCAGCCTGGGCCACAGAGCGAGACTCCGTCTCAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAGCTGATAAAA.TT 
TGAATAAGCCCTGTAGTTTAGTTAATAATAGTGAAGCCATGTTA 
ATTTCCTGGGTTTGGTCATTGTGCTCTGGTTATGCAAGTTGTTA 
ACATTAGAGGAGACTGAGTGAAAGGTATGCATGAACTCTCTGTA 
CTAATTTTGTAAATTT [C/T] CTGTAAGTCTAAAATTATTCATA 
ATATGCAAAAATTAAACAAAAAATAAAATT^AAATAAGCACATGG 
AATGAGACTGTCCCCTGGGTCTCTGTAGAAACCAGGTCAAACAT 
CCCAAATGCTCTTTTACCCCCATTCTGAGTTGGGCCAGAATGGT 
CAGAATAATGGTTCCCAATGTACCTTGATAAACACGGAAACTCT 
CAGGACCGAGTCCTAAGGTTCTCTGATTCAATAGGTTTGGAGTG 
GACTTGAGAACTGATCTTTTTAATAAGGGCCTCAGTCTGTGGAA 
CTATTGGCCTCATGTGCCCTGT.GGATAATCTTGGCTGTTGGTTC 
ATTTTTCTTAACTGAAAACAGTGGCAGAAACTATGGGGATTTTT 
AAATCTCTAGGCTAGAACATTAACTTTTTAA7^AATTCAGAATAG 
TATTTTATTTGCCTCAAGCCTGTGAATGGGGATCCCACAAATCA 
CCCCCCACTGAAGACAATGC CCATAACAAGGTAAC C T 


SEQID 
NO. 235 


KCP_1540 
0 


TTATGCAAGTTGTTAACATTAGAGGAGACTGAGTGAAAGGTATG 
CATGAACTCTCTGTACTAATTTTGTAAATTTTCTGTAAGTCTAA 
AATTATTCATAATATGCAAAAATTAAACAAAAAATAAAATAAAA 
TAAGCACATGGAATGAGACTGTCCCCTGGGTCTCTGTAGAAACC 
AGGTCAAACATCCCAAATGCTCTTTTACCCCCATTCTGAGTTGG 
GCCAGAATGGTCAGAATAATGGTTCCCAATGTACCTTGATAAAC 


SEQID 
NO. 236 



BNSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-173- 



PCT/US2003/034681 





ACGGAAACTCTCAGGACCGAGTCCTAAGGTTCTCTGATTCAATA 

GGTTTGGAGTGGACTTGAGAACTGATCTTTTTAATAAGGGCCTC 

AGTCTGTGGAACTATTGGCCTCATGTGCCCTGTGGATAATCTTG 

GCTGTTGGTTCATTTTTCTTAACTGAAAACAGTGGCAGAAACTA 

TGGGGATTTTTAAATCTCTAGGCTAGAACATTAACTTTTTAAAA 

ATTCAGAATAGTATTTTATTTGCCTCAAGCCTGTGAATGGGGAT 

CCCACAAATCACCCCCCACTGAAGACAATGCCCATAACAAGGTA 

ACCTACCCATGAGCTTCTGAGGGATTTAGGAATTGTCTACCATC 

TCCTCTCTAAGAAGGGCTCCCACAATATATCCCCTTCTGCTTGC 

TTCTAACTCCCTATCACCTGCTAAAGAAGGACCTCACCTTTTAA 

TCACTTTCATTGCCAAGGGGCACAAGGAGCCCCAAACTCTGTCA 

CCTAGGAAGAGCTTGACCTCATGGTTTCCACACTGTGTGCTTTT 

ATGTCCCTGCTCCAGGAGATGATGGACATTGTCAAAGCCATCTA 

TGACATGATGGGGAAATACACATATCCTGTGCTCAAAGAGGACA 

CTCCAAGGCAGCATGTGGACGTCTTCTTCCAGGTAAGTGCACAC 

ACCCTGCACATGAGCTGTAAGCCCAGCCTAGATCAAGTCAACCC 

ACGAGCATCTGAGCAAATGATTTGTGTCCAAC [C/T] CTGTACT 

AAGCATGGTTGGTAACAGAAAAGAATTATAAGATACATTGTCCT 

CAAGAAACAGATGATCTCCTTAAGCTGCAAGTGTACATGACAGA 

AGAGAACAAGAAAGTATATTATTAAACGCTAGTGGTATAGTATG 

AACTCTAAATCCATAAAAATTTGGGGATCAGGGTAAACACGAAA 

GACTTCATTAATTACAACTGTGGAGGTGTTAAGCATTTGTGTCT 

GGGAAGTAAGGGGAAATAAGATTGGAAACTAGGATAGGGCCAGA 

TTATGAGACCTTTAAATGGAAGAGTTTGGCCTTGCTCTGGTACA 

GGATGGGCAGCTAGTGCTGATCCTTGACTAAGGGAGTGGTATAA 

TCATTGGGGCATTTTAGGAAAAAATTAATCTAGCGGTGGAGTAT 

CAGAGAATATCAAGAGTTCACTCTAGTTCAACCTCCCACTTTGC 

AGATGGGAAAAGAGAGTCCTCTCTGGCCTTGTGCAAGTTTGTAC 

AGCAAGTAACAGGCCAGAATCAGAACCTCTTTTGCCCAGTGTTC 

TGCCAGATGGACAGGGTAGCAGGGAGTCTACAGAAGAAGCAGAA 

TAAGCCAGCAGTGAGGTGATGAGTGTCCAGAGCAAGTCTTTTGA 

TTTAAGGAAGCTCATGGGGCTCAAAGTGTTGTAATCAGGACCTA 

ATTGGAGTTGTCTGGCCAGTGAAAGACAA.CTCTCATTCTCAGGG 

CAAAGTTGGTTAATGAAATGAATGAAATGAGCTCCAGCTCGTTA 

CTCTGAGCTCCAGCAAGAAAGCAGGGGAGTAAGCTTTGGAATGG 

AGATCACCAGATTCTGTAAAGTGCTTTCTGTTATGTCTTTCAGA 

AAATGGACAAAAATAAAGATGGCATCGTAACTTTAGATGAATTT 

CTTGAATCATGTCAGGAGGTAAGGAGAGATCTCAGGGCACAATA 

ACTCTACATCTGGGAAAGGAAACCTGGGGCCTGGGGACCTGCAG 

AAGGAAGGTGATGAGAAACCTGCAC 




KCP 2385 
91 


TCTAACTCCCTATCACCTGCTAAAGAAGGACCTCACCTTTTAAT 
CACTTTC ATTGCCAAGGGG CACAAGGAGCCC CAAACTCTGTCAC 
CTAGGAAGAGCTTGACCTCATGGTTTCCACACTGTGTGCTTTTA 
TGTCC CTGCTC CAGGAG ATGATGGACATTGTCAAAGCCATCTAT 
GACATGATGGGGAAATACACATATCCTGTGCTCAAAGAGGACAC 
TCCAAGGCAGCATGTGGACGTCTTCTTCCAGGTAAGTGCACACA 
CCCTGCACATGAGCTGTAAGCCCAGCCTAGATCAAGTCAACCCA 
CGAG CATCTGAGCAAATGATTTGTGTCCAAC C CTGTACTAAGC A 
TGGTTGGTAACAGAAAAGAATTATAAGATACATTGTCCTCAAGA 
AACAGATGATCTCCTTAAGCTGCAAGTGTACATGACAGAAGAGA 
ACAAGAAAGTATATTATTAAACGCTAGTGGTATAGTATGAACTC 
TAAAT C CAT AAAAATT [C/T] GGGGATCAGGGTAAACACGAAAG 
ACTTCATTAATTACAACTGTGGAGGTGTTAAGCATTTGTGTCTG 
GGAAGTAAGGGGAAATAAGATTGGAAACTAGGATAGGGCCAGAT 
TATGAGACCTTTAAATGGAAGAGTTTGGCCTTGCTCTGGTACAG 
G ATGGGC AGC TAGTGCTGATC CTTGACTAAGGGAGTGGTATAAT 
CATTGGGGCATTTTAGGAAT^AAATTAATCTAGCGGTGGAGTATC 


SEQID 
NO. 237 



INSDOCID: <WO 2004041 193A2_L> 



WO 2004/041193 



-174- 



PCT/US2003/034681 





AGAGAATATCAAGAGTTCACTCTAGTTCAACCTCCCACTTTGCA 
GATGGGAAAAGAGAGTCCTCTCTGGCCTTGTGCAAGTTTGTACA 
GCAAGTAACAGGCCAGAATCAGAACCTCTTTTGCCCAGTGTTCT 
GCCAGATGGACAGGGTAGCAGGGAGTCTACAGAAGAAGCAGAAT 
AAGCCAGCAGTGAGGTGATGAGTGTCCAGAGCAAGTCTTTTGAT 
TTAAGGAAGCTCATGGGGCTCAAAGTGTTGTAATCAG 




KCP 1615 
2 


GGAAGAGCTTGACCTCATGGTTTCCACACTGTGTGCTTTTATGT 

C CCTGCTC CAG G AGATGATGGACATTGTCAAAGC CATCTATGAC 

ATGATGGGGAAATACACATATCCTGTGCTCAAAGAGGACACTCC 

AAGGCAGCATGTGGACGTCTTCTTCCAGGTAAGTGCACACACCC 

TGCACATGAGCTGTAAGCCCAGCCTAGATCAAGTCAACCCACGA 

GCATCTGAGCAAATGATTTGTGTCCAACCCTGTACTAAGCATGG 

TTGGTAACAGAAAAGAATTATAAGATACATTGTCCTCAAGAAAC 

AGATGATCTCCTTAAGCTGCAAGTGTACATGACAGAAGAGAACA 

AGAAAGTATATTATTAAACGCTAGTGGTATAGTATGAACTCTAA 

ATCCATAAAAATTTGGGGATCAGGGTAAACACGAAAGACTTCAT 

TAATTACAACTGTGGAGGTGTTAAGCATTTGTGTCTGGGAAGTA 

AGGGGAAATAAGATTGGAAACTAGGATAGGGCCAGATTATGAGA 

CCTTTAAATGGAAGAGTTTGGCCTTGCTCTGGTACAGGATGGGC 

AGCTAGTG CTGATC CTTGACTAAGGGAGTGGTATAATCATTGGG 

GCATTTTAGGAAAAAATTAATCTAGCGGTGGAGTATCAGAGAAT 

ATCAAGAGTTCACTCTAGTTCAACCTCCCACTTTGCAGATGGGA 

AAAGAGAGTCCTCTCTGGCCTTGTGCAAGTTTGTACAGCAAGTA 

ACAGGCCAGAATCAGAACCTCTTTTGCCCAGTGTTCTGCCAGAT . 

GGACAGGGTAGCAGGGAGTCTACAGAAGAAGCAGAATAAGCCAG 

CAGTGAGGTGATGAGTGTCCAGAGCAAGTCTTTTGATTTAAGGA 

AGCTCATGGGGCTCAAAGTGTTGTAATCAGGACCTAATTGGAGT 

TGTCTGGCCAGTGAAAGACAACTCTCATTCTCAGGGCAAAGTTG 

GTTAATGAAATGAATGAAATGAGCTC CAGCTC [A/G] TTACTCT 

GAGCTCCAGCAAGAAAGCAGGGGAGTAAGCTTTGGAATGGAGAT 

CACCAGATTCTGTAAAGTGCTTTCTGTTATGTCTTTCAGAAAAT 

GGACAAAAATAAAGATGGCATCGTAACTTTAGATGAATTTCTTG 

AATCATGTCAGGAGGTAAGGAGAGATCTCAGGGCACAATAACTC 

TACATCTGGGAAAGGAAACCTGGGGCCTGGGGACCTGCAGAAGG 

AAGGTGATGAGAAACCTGCACATAC CTGCAACCC CTCCCATCAG 

AGCCAACAACACCAGCAACAACTGTGAAGTCCACAGTTCCACTC. 

CTCAACCTGACCTGCAGTTGGTCTTGGCTAAGCACAAGACTGAA 

CAGAGAGCCTAAGTAGGGGTCTGGGGGCATGTGAAAACTCAGAG 

GGGGTCTCTGTGAAAATAGACTTCCCGAGAGGGCAACACCATTA 

TTTTTTAGCCTGCCTCTGGCTTGATGACCCATTTCCCAGACTAC 

AAGGAAGCAGCTGGGGGGAAAAAAACCTACAATTGTGTGATTCT 

CAAACCACAGTGTGCATAAAAATTGCCTGGAATGATTCTGAAAA 

TG CATATTTCCAGGC CTCAATC CCAGAGACTCTAGATCTGGGTC 

ACTTTAACACAAATGTCCTGGACCAATGCTTCTAACACTTTAAT 

GTGTGAT^ACAATATCCTTGATGATTTTGTTAAAATGCAGATTCT 

AATTCCATAGGTCTGGGGTAGGGCCTGAGATGTTACTTTTCTCA 

CATTCTCCCCAGTCACACTGGTGATGCTGATCCTGGGAACACAA 

CTTTCATTAAGTCTAACCAATAGACCAGCCCCAGAGTCCACCAG 

AGACTGAACTGGAAATAATTGCTTCATCTACTTTTGAGAAATCC 

-mj. x ± vj x^-iv_v^^v_v_/a.v— rt. jl ihi 1 1 1 AIjAAAivjITCAGAGXTACTCTG i 

AG CTC CAG C CAAGAAGAATAGCAAATGTAAGAAAGCCGGGGAGA 

AGTTCCTAGCAGATACTG AGCCCC C 


SEQED 
NO. 238 


KCP 18 06 
9 


TTGAAAGAGAGCGCTTTGGGGGGTTTTCTTACTGTATGTCTCTA 
TTGCATGTTCTGTATTTTACATTTTTCTATTATTTCTTCTCTGA 
GGTATAGTATTGAATGTAGAAAAATCCTCAAATGTTCGGTATTA 
AGCAATACAC TTCTAATTCATGGTTC AGAGAAGAAAATATCTC G 
AATAAAAATAAAATAAAAATATGACTTATCAAAATTTGTAGGAT 


SEQID 
NO. 239 



3NSDOCID: <WO 200404 1 1 93A2_L> 



WO 2004/041193 



PCT/US2003/034681 



-175- 





CTAAAGCAGTATTCCAGGAATGCAAGGTTGGTTTAACATTCAAT 
AATTGGTCAGTGTAATTAATCACATTAATAGAATAAAAAGAGAA 
AAAATATAATCATTTCAGTGGATGTAATTGTTCAGAGCTTCTTA 
AAAGAAGCAACTCACTATTTTACTAGATGATTTGTTTCTTCTGA 
ATTCCTCTTTAAGGCTACAGGTGGTGCTTCTTACTTTGAACTGA 
TCACTTTCTAGGTCCCCACCCTTACTTCTTGTTTTTCATACCCT 
TGTAGAGTTTTCTCCA [C/T] ATAGGAAAC CCATGC TTGACATT 
TGCTCACCAGAGTTACAGAGCTCTCAGGGAGGAGACTCAGAGTT 
CTAACCCTCTTGCCCTCCTTTTTTCCCAGGACGACAACATCATG 
AGGTCTCTCCAGCTGTTTCAAAATGTCATGTAACTGGTGACACT 
CAGCCATTCAGCTCTCAGAGACATTGTACTAAACAACCACCTTA 
ACACCCTGATCTGCCCTTGTTCTGATTTTACACACCAACTCTTG 
GGACAGAAACACCTTTTACACTTTGGAAGAATTCTCTGCTGAAG 
ACTTTCTATGGAACCCAGCATCATGTGGCTCAGTCTCTGATTGC 
CAACTCTTCCTCTTTCTTCTTCTTGAGAGAGACAAGATGAAATT 
TGAGTTTGTTTTGGAAGCATGCTCATCTCCTCACACTGCTGCCC 
TATGGAAGGTCCCTCTGCTTAAGCTTAAACAGTAGTGCACAAAA 
TATGCTGCTTACGTGCCCCCAGCCCACTGCCTCCAAG 




KCP_2415 
27 


ACTTTGAACTGATCACTTTCTAGGTCCCCACCCTTACTTCTTGT 
TTTTCATAC CCTTGTAGAGTTTTCTC C ATATAGGAAACCCATGC 
TTGACATTTGCTCACCAGAGTTACAGAGCTCTCAGGGAGGAGAC 
TCAGAGTTCTAACCCTCTTGCCCTCCTTTTTTCCCAGGACGACA 
ACATCATGAGGTCTCTCCAGCTGTTTCAAAATGTCATGTAACTG 
GTGACACTCAGCCATTCAGCTCTCAGAGACATTGTACTAAACAA 
CCACCTTAACACCCTGATCTGCCCTTGTTCTGATTTTACACACC 
AACTCTTGGGACAGAAACACCTTTTACACTTTGGAAGAATTCTC 
TGCTGAAGACTTTCTATGGAACCCAGCATCATGTGGCTCAGTCT 
CTGATTGCCAACTCTTCCTCTTTCTTCTTCTTGAGAGAGACAAG 
ATGAAATTTGAGTTTGTTTTGGAAGCATGCTCATCTCCTCACAC 
TGCTGCCCTATGGAAG [G/T] TCCCTCTGCTTAAGCTTAAACAG 
TAGTGCACAAAATATGCTGCTTACGTGCCCCCAGCCCACTGCCT 
CCAAGTCAGGCAGACCTTGGTGAATCTGGAAGCAAGAGGACCTG 
AGCCAGATGCACACCATCTCTGATGGCCTCCCAAACCAATGTGC 
CTGTTTCTCTTCCTTTGGTGGGAAGAATGAGAGTTATCCAGAAC 
AATTAGGATCTGTCATGACCAGATTGGGAGAGCCAGCACCTAAC 
ATATGTGGGATAGGACTGAATTATTAAGCATGATATTGTCTGAT 
GACCCAAACTGCCCATGTCATTTGTTTCCAGAAACGAGGACCAA 
TAATTCTCTCACACTGGCATTTGTGCTGGTAGTACAAGTCCTTT 
AATATGTCCAGGAAGGGAGCCATTGCCCAGTGGTCCATATCTCC 
ACCACATCCCCTGCTTGAGCCCAGCGCTGCATGTCCCTCCCAAG 
AAGTCCAGAATGCCTGCAAATTGCTGTAATTTTATAC 


SEQID 
NO. 240 


KCP 2418 
04 ! 


CTGATCTGCCCTTGTTCTGATTTTACACACCAACTCTTGGGACA 
GAAACACCTTTTACACTTTGGAAGAATTCTCTGCTGAAGACTTT 
CTATGGAACCCAGCATCATGTGGCTCAGTCTCTGATTGCCAACT 
CTTCCTCTTTCTTCTTCTTGAGAGAGACAAGATGAAATTTGAGT 
TTGTTTTGGAAGCATGCTCATCTCCTCACACTGCTGCCCTATGG 
AAGGTCCCTCTGCTTAAGCTTAAACAGTAGTGCACAAAATATGC 
TGCTTACGTGCCCCCAGCCCACTGCCTCCAAGTCAGGCAGACCT 
TGGTGAATCTGGAAGCAAGAGGACCTGAGCCAGATGCACACCAT 
CTCTGATGGCCTCCCAAACCAATGTGCCTGTTTCTCTTCCTTTG 
GTGGGAAGAATGAGAGTTATCCAGAACAATTAGGATCTGTCATG 
ACCAGATTGGGAGAGCCAGCACCTAACATATGTGGGATAGGACT 
GAATTATTAAGCATGA [C/T] ATTGTCTGATGACCCAAACTGCC 
CATGTCATTTGTTTCCAGAAACGAGGACCAATAATTCTCTCACA 
CTGGCATTTGTGCTGGTAGTACAAGTCCTTTAATATGTCCAGGA 
AGGGAGCCATTGCCCAGTGGTCCATATCTCCACCACATCCCCTG 
CTTGAGCCCAGCGCTGCATGTCCCTCCCAAGAAGTCCAGAATGC 


SEQID 
NO. 241 



5NSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 PCT/US2003/034681 

-176- 





CTGCAAATTGCTGTAATTTTATACCATGTTCTAACCAATAAACA 
GAACTATTTCTTACACTCTCAATCACTTCTTCATGACTCCGTTA 
GGTAAGAGAGGTAAGCTGTGAAAAGGGAAGGCTAGTCCATTCAT 
TTGACACCCAATTATTAGTGCAGTTGTCCCTCCATATGTGTGAA 
GGATCAGTCCCAGGACTCTCCATACCAAAATCTGCAGATACTCA 
AGTCCCACAGCTAGCCCTGAGGGACTCGTGTTTTCAGAAAATTT 
GGCCTCCATATATGCAGGTTTCACATCCTATAAATAC 




KCP_13 24 


CCCAAGCTCAACCATTCCAATGCCATCTCCTCTGGTTCCAGATA 
AGATTGAAGATGAGCTGG AGATGAC CATGGTTTGC CATCGGCC C 
GAGGGACTGGAGCAGCTCGAGGCCCAGACCAACTTCACCAAGAG 
GGAGCTGCAGGTCCTTTATCGAGGCTTCAAAAATGTAAGACCCG 
TGCACGCTCTGAAGGCCTGGGGGG 


SEQID 
NO. 242 


KCP_152 0 
4 


TTGTCTACCATCTCCTCTCTAAGAAGGGCTCCCACAATATATCC 
CCTTCTGCTTGCTTCTAACTCCCTATCACCTGCTAAAGAAGGAC 
GTCACCTTTTAATCACTTTCATTGCCAAGGGGCACAAGGAGCCC 
CAAACTCTGTCACCTAGGAAGAGCTTGACCTCATGGTTTCCACA 
CTGTGTGCTTTTATGTCCCTGCTC 


SEQID 
NO. 243 


KCP_4957 


ACCCTCAATACAGACTGTTCCTACAGTCCACGTCCTCAGCCACT 
AGACCATACGGCCACTGGGATGATAGACAGACCACTGCAGCCAT 
GGATAAGGCAAAAACAGGGCTGGCTGTGTTGATCTGTGTCTCTC 
AGAGCTCCATTCTTCCTCAAGGGGGCACCTTGCAAAAAAAAACA 
AAAAAATGGGGCAGGGTAGGGAAC 


SEQID 
NO. 244 


KCP_5 011 


GCCACTGGGATGATAGACAGACCACTGCAGCCATGGATAAGGCA 
AAAACAGGGCTGGCTGTGTTGATCTGTGTCTCTCAGAGCTCCAT 
TCTTCCTCAAGGGC3GGACCTTGCAAAAAAAAACAAAAAAATGGG 
GCAGGGTAGGGAACTGAAGGCAGGAGCTCTTCACAGAGCATAGC 
CACATCCTCCAGGCAGACAAGAGG 


SEQID 1 
NO. 245 


KCP_5 051 


GGCAAAAACAGGGGTGGCTGTGTTGATCTGTGTCTCTCAGAGCT 
CCATTCTTCCTCAAGGGGGCACCTTGCAAAAAAAAACAAAAAAA 
TGGGGCAGGGTAGGGAACTGAAGGCAGGAGCTCTTCACAGAGCA 
TAGCCACATCCTCCAGGCAGACAAGAGGACGCAGGAGGCACCAT 
TCTGTGAGAGTATCACAGTCTGAC [C/T] CAAAGACACAGCTTC 

ACACTGTCTGATGGCTTGATGGTTAATGTCACTCTGCCTTTTCC 
C CTTCTCAGGACTTTGTAACCG CTCTGTCGATTTTATTGAGAGG 
AACTGTCCACGAGAAACTAAGGTGGACATTTAATTTGTATGACA 

TCAACAAGGACGGATACATAAACAAAGAGGTAAGTGAGCTGGGG 
CCAGGGGTGT 


SEQID 
NO. 246 


KCP_52 02 


GACAAGAGGACGCAGGAGGCACCATTCTGTGAGAGTATCACAGT 
CTGACCCAAAGACACAGCTTCACACTGTCTGATGGCTTGATGGT 
TAATGTCACTCTGCCTTTTCCCCTTCTCAGGACTTTGTAACCGC 
TCTGTCGATTTTATTGAGAGGAACTGTCCACGAGAAACTAAGGT 
GGACATTTAATTTGTATGAGATCA [A/C] CAAGGACGGATACAT 
AAACAAAGAGGTAAGTGAGCTGGGGCCAGGGGTGTGAGAGGGCT 
C C AGTGAAGGTAACTAACC CAACAG AAAACAGCCC C AGGCATG A 
GGATAGCACTGTCTGAATGAGGCAGGCTCTGCTTTGGGGCTAAC 

AGAGCTGGTCCCTGGCAAAATAAAGAAGGCCTCCCTCATTGCCC 
TACCCTGCCC 


SEQID 1 
NO. 247 


KCP_ela 
249924 


CCACCAGGGTCCCTTCCAACTCACGGAGCCTATGGTACTGAATG 

GCAGCCAGGTTTTTTATGGAGCAATAGCTGGACTTCACATTTGC 
ATAATGCCTTGClAGTTTraPTf3T , Ta an^nm-n ^mz-i/T* m mmm « ~_ 
vjv— v- j. j. in ^-^t.v— iui ^-"AoACaTACXGGATTGTATTC! 

TAATTATATGAATCTCGGTCATTCCTTTATGACATTTCTGAGGA 
ATACTATCTCAATCAAGAAAAGCCCTAATTGCACTCCTCTCCTA 
TCCCGGTGAGAGAGCACAGACTCGTGCCTGCTCCGCAGGGGTGG 
AGGCTGGAATTCAGTAGTCTGAGTCGGGGATGCCTGGAGCAGGA 
GGTGGTCAGGGGCATTGTCCTTTCCAAGTCAGGAAGGCAGACAG 
CACCTGCTGTTGGTGCCAAGGTTACTGGACAGGCTGCGAGGGCT 
CTGTCTGTCTGTCCGATGTTCACAGGCCAGCTCCCCGGAGGCTC 


SEQID 
NO. 248 



WSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



-177- 



PCT/US2003/034681 





AGCACTCAGCCCAGCTTCTCCGAGATGCAAACCAGGCCACTCTG 

AGGCTGCCTACAAACTTTCTGCTGAGTGCCGACAGCTGCTTCCT 

GCTCTGCGGGGAGTTCTTCCAGATCCTGATCAAGGCACAGAGAA 

TTGATCTATCAGATTAACCAGGAAGGAAAGAGTGGGAGAGCGAG 

TGTGGGAGGCTGTGGGGCTGAGTGTTTTCTGCGTAGCAGTCCCC 

TCCCTTCTGACTTGAGTATTAATTGCTACATTACCGCTGCCATG 

TAAGAAAGACAGTCAGCAAAGCCTGGGAGAGCTCCAGCTCCTCC 

CTCCCTGCTCTGCTCAACTTCACTCTCCTCCTCGGTTCCCTTGG 

AGTACCTTGTGCCCCGGCAGTGCTGTCCCGGCCCTGGCATCCTG 

AGGTCCTCCCGTGGTGAGGACTTAAGTGGACA [C/G] CAGGAGT 

GGGTGGAGAGAGGGAGGGAGAGTTTGCCCTGCAGGCTCTCTGGA 

TGCAGAAGCCAGACTCGCTGCAGAGGCAGCTGTGCTGTTCCCGG 

AGCCTGGCTTCAGGGGTGCATCCGTCACTCAGGGTTCATTCACC 

CAGGCAGGCTCCAAGTTCCTGGGGTGCACAAGGTGGGCACTGTC 

CCTTCTGGGTGCTGACAGCAGAGCCTGGCTCCCCTCCGCCACCA 

TGAGCGGCTGCTCCAAAAGATGCAAGCTTGGGTTCGTGAAATTT 

G C CCAGACCATCTTTAAGCTCATCACTGGGACCCTCAG CAAAGG 

TATGGAAACTGGCCTTGACCCTTGCTTTCTGTCTTGATATGGCC 

TGGCTGGTCGCATTGCCTCGGTGTGGTGAGCGTGACCATTCTGG 

TGCACCCAGGTCTTGGAAAAAGCTGGGGAAATTGGTGGCTGGGA 

TTCGAGGTTGCTGACAACCTGCGTCCTGGCTTTGAGTAGGCGGG 

CACCCAGCCAGGGAACTCAGCTGGCTGTAATTGCCTGGAACTTT 

GGAAATGGAGTTGGTGGTGTGTGGCTGATACGTTATGGGCGGGC 

AGAGGGATAGAACCCTTTCCAGAGCATTGGAAGTGGCTTAGCGT 

GACTGGAGTTTCAAGAAGTTATCCATGGAAGGTTGTATTTTGTT 

GATAAAAGAGAGATTTGATGCAGTGGGTTGTGAGTAATTCTGCA 

GAACAGAGACGCTTGAGGGGGCCAGTGGGAGGTGGTGATGGGCC 

GGCATCTGCTTTGCCCTGGTGGCTTCAGAAACCGGATCAGCTCT 

GCACCTCAAGTGCCAAGAGCCTCCTCTCATAGGGTTCCAGCGTC 

TCGTGCTTCTGGGGCTTCATTCATCGTTCTGCTTTCTTGGATCC 

CTGTCCCTCCACATTTCATGCCTA 




KCP_ela_ 
250027 

• 


CAAGGCACAGAGAATTGATCTATCAGATTAACCAGGAAGGAAAG 
AGTGGGAGAGCGAGTGTGGGAGGCTGTGGGGCTGAGTGTTTTCT 
GCGTAGCAGTCCCCTCCCTTCTGACTTGAGTATTAATTGCTACA 
TTACCGCTGCCATGTAAGAAAGACAGTCAGCAAAGCCTGGGAGA 
GCTCCAGCTCCTCCCTCCCTGCTCTGCTCAACTTCACTCTCCTC 
CTCGGTTCCCTTGGAGTACCTTGTGCCCCGGCAGTGCTGTCCCG 
GCCCTGGCATCCTGAGGTCCTCCCGTGGTGAGGACTTAAGTGGA 
CAGCAGGAGTGGGTGGAGAGAGGGAGGGAGAGTTTGCCCTGCAG 
GCTCTCTGGATGCAGAAGCCAGACTCGCTGCAGAGGCAGCTGTG 
CTGTTC CCGGAGCCTGG [C/T] TTCAGGGGTGCATCCGTCACTC 
AGGGTTCATTCACCCAGGCAGGCTCCAAGTTCCTGGGGTGCACA 
AGGTGGGCACTGTCCCTTCTGGGTGCTGACAGCAGAGCCTGGCT 
CCCCTCCGCCACCATGAGCGGCTGCTCCAAAAGATGCAAGCTTG 
GGTTCGTGAAATTTGCCCAGACCATCTTTAAGCTCATCACTGGG 
ACCCTCAGCAAAGGTATGGAAACTGGCCTTGACCCTTGCTTTCT 
GTCTTGATATGGCCTGGCTGGTCGCATTGCCTCGGTGTGGTGAG 
CGTGACCATTCTGGTGCACCCAGGTGTTGGAAAAAGCTGGGGAA 
ATTGGTGGCTGGGATTCGAGGTTGCTGACAACCTGCGTCCTGGC 
TTTGAGTAGGCGGGCACCCAGCCAGGGAACTCAGCTGGCTGTAA 


SEQID 
NO. 249 


KCP_ela 
250049 


ACAGAGAATTGATCTATCAGATTAACCAGGAAGGAAAGAGTGGG 
AGAGCGAGTGTGGGAGGCTGTGGGGCTGAGTGTTTTCTGCGTAG 
CAGTCCCCTC C CTTCTGACTTGAGTATTAA.TTG CTACATTACC G 
CTGCCATGTAAGAAAGACAGTCAGCAAAGCCTGGGAGAGCTCCA 
GCTCCTCCCTCCCTGCTCTGCTCAACTTCACTCTCCTCCTCGGT 
TCCCTTGGAGTACCTTGTGCCCCGGCAGTGCTGTCCCGGCCCTG 
GCATCCTGAGGTCCTCCCGTGGTGAGGACTTAAGTGGACAGCAG 


SEQED 
NO. 250 



JNSDOCID: <WO 2004041 1 93A2 J_> 



WO 2004/041193 



-178- 



PCT/US2003/034681 



KCP_UTR1 
382206 



KCP_UTR1 
382272 



KCP 3UTR 



GAGTGGGTGGAGAGAGGGAGGGAGAGTTTGCCCTGCAGGCTCTC 
TGGATGCAGAAGCCAGACTCGCTGCAGAGGCAGCTGTGCTGTTC 
CCGGAGCCTGGCTTCAGGGGTGCATCCGTCACT [A/C] AGGGTT 
CATTCACCCAGGCAGGCTCGAAGTTCCTGGGGTGCACAAGGTGG 
GCACTGTCCCTTCTGGGTGCTGACAGCAGAGCCTGGCTCCCCTC 
CGCCACCATGAGCGGCTGCTCCAAAAGATGCAAGCTTGGGTTCG 
TGAAATTTGCCCAGACCATCTTTAAGCTCATCACTGGGACCCTC 
AGCAAAGGTATGGAAACTGGCCTTGACCCTTGCTTTCTGTCTTG 
ATATGGCCTGGCTGGTCGCATTGCCTCGGTGTGGTGAGCGTGAC 
CATTCTGGTGCACCCAGGTCTTGGAAAAAGCTGGGGAAATTGGT 
GGCTGGGATTCGAGGTTGCTGACAACCTGCGTCCTGGCTTTGAG 
TAGGCGGGCACCCAGCCAGGGAACTCAGCTGGCTGTAATTGCCT 
GGAACTTTGGAAATGGAGTTGGTG 



TGGCCCACCTTCAGGGTCATGAGGATTCATAAACCCTATTCTGC 

GAAGTGCCTC CAGGAATCATC AAGGGAGCTAGGG CAGCTCTGAG 

TCTCCACCAGGCCCACCCTCCGCCTCTCAGGGCTGAGCTTCACT 

TCCCTTCCCAAAGGGGCCAGGGAGAGGGGCTGCTGATGACATGA 

TCTCAGAGGAAGGCCAAGGCCTCCAGGCTGCCTCTGGGCCTGGC 

ACAGGAAGGAGGAGGAGAAAATAGGGAGCCCAAGGAAAGATCAA 

CCCAGCCCAGCCCAAGGACCCCCAGCCCCAGCCCCAGCCCCAGC 

TGGGCTCAAACTAATTGAAAACAGACTGGAAAAGGCTGCTTTTG 

CCCTTCCTCTAGACTCAGCATCATCAAGACTGGAGGGACAGAGC 

ATTTGAATCATCAGACGCTGGGCCAGA [C/T] GTCACCCCACGC 

GTTTTCTCATTTTATCGTCCTAAGAAGCCCAGAAGGTGCGTAAA 

ATGGCCTGTCCCAAACAGATGAGGACATTACCTTTCTCCTCTTC 

CTCCTCCTCCTTCTTCTTCTTCTTCTTTTTGCTTCATTTTTCTT 

TCATTTTTTCCCCCAGATGTTGCATTTCAGAGAGGCTGAGCGTG 

TTGACTAAGGTCACACAGCTACAAACATCAGGGACCTGCGAAAA 

AGCTCTGTTCCCTGGTGACAGGTGTTCTGTGATCCTAACACAGC 

CGGAGGTGGGGACAACGTCCTTGCAGTAACAAAGGCCCTGTTGC 

TCAACTCAGTGGACATCAGGCCCTGTTTTCATTCATTAGCAGGT 

CAGGGATTC CAGTGTCACCTGTGCCATGTATTC CAG CTG ATCTA 

CCTGCAAGCCTCTACTCCCCATTTTCCCAGCAGCAGCCGCAGAC 
ACCACCCAACTGG 



SEQID 
NO. 251 



GGGTCATG AGGATTCATAAACCCTATTCTGCGAAGTGC CT CCAG 
GAATCATCAAGGGAGCTAGGGCAGCTCTGAGTCTCCACCAGGCC 
CACCCTCCGCCTCTGAGGGCTGAGCTTCACTTCCCTTCCCAAAG 
GGGCCAGGGAGAGGGGCTGCTGATGACATGATCTCAGAGGAAGG 
CCAAGGCCTCCAGGCTGCCTCTGGGCCTGGCACAGGAAGGAGGA 
GGAGAAAATAGGGAGCCCAAGGAAAGATCAACCCAGCCCAGCC C 
AAGGACCCCCAGCCCCAGCCCCAGCCCCAGCTGGGCTCAAACTA 
ATTGAAAACAGACTGGAAAAGGCTGCTTTTGCCCTTCCTCTAGA 
CTCAGCATCATCAAGACTGGAGGGACAGAGCATTTGAATCATCA 
GACGCTGGGCCAGACGTCACCCCACGCGTTTTCTCATTTTATCG 
TCCTAAGAAG CC CAGAAGGTGCGTAAAATGGC CTGT [A/C] CCA 

AACAGATGAGGACATTACCTTTCTCCTCTTCCTCCTCCTCCTTC 
TTCTTCTTCTTCTTTTTGCTTCATTTTTCTTTCATTTTTTCCCC 
CAGATGTTGCATTTCAGAGAGGCTGAGCGTGTTGACTAAGGTCA 
CACAGCTACAAACATCAGGGACCTGCGAAAAAGCTCTGTTCCCT 
GGTGACAGGTGTTCTGTGATCCTAACACAGCCGGAGGTGGGGAC 
AACGTCCTTGCAGTAACAAAGGCCCTGTTGCTCAACTCAGTGGA 
CATCAGGCCCTGTTTTCATTCATTAGCAGGTCAGGGATTCCAGT 
GTCACCTGTGCCATGTATTCCAGCTGATCTACCTGCAAGCCTCT 
ACTCCCCATTTTCCCAGCAGCAGCCGCAGACACCACCCAACTGG 

CAGAAATTTCAAACAAGGGGTTCTGCCTTGCACTCCGGTGCAAG 
GGTTGGGCACGTGGACTCACAT 



CACAAAACAAATCCGGGACTTTAAGCCTGATCTGCTTGACCTGA 



SEQID 
NO. 252 



SEQID 



WO 2004/041193 



-179- 



PCT/US2003/034681 



2_3 95 068 


AACTCATATCTACTTCCCTGCCCTCTGAAGA.TCTATATGTCCTA 
TGTCATCACTTCACTGTTCACACAAGGTGATACCTGGCTTCTCC 
AAGCACCTGCTACCCTGAACTTACTGCACCACTCTTTCCTTCCT 
AGCCTGAATGCAATTTGCAATGAGGAGATGATTTGATTTTCTTC 
AGCCCTAGACCTCCAGCTTCCTGAGAGCAGGTACTCTTGCCTCT 
TCTTGCTCATTATTGATCCATATATTTAGAATAGCGCCTGGCAG 
GTAGATGGTGCTTAATAAATATTCATTGAATAAATGAATGAATG 
AATGATCCAATGAGCCCCAAAGCAAATAACAATAAAGGACATTT 
GCAGAGTGCTCTACAGAGAGACAAGTGCTTTCCCTTT [A/G] CT 

TTATCTTACCCCATTCTCACAACAATCCCCTGACATGATTGGGT 
TCATGTTTCACAGATGAGGAGGCTAACGGP r ARRTOTa r A t a pp 

AGGGGACATGGGACTGGGTTCATATGAGCTCAGGGGTAAATGAT 

GACACCCTTTCCCCTGCCCTGAAGGATCTCAGTTTGAGTATTTG 

TAGCACACTTAGGATGTTCTGGGCCAGGCTGAGTGGCGGTGGAT 

GGGGGCGGTGGAGGTGGGGTATGCAAAGCAGGAAACTCGGCCTT 

TGCTTTCTAAAAGCTCCCAGTCTATTTGAGGCCAGACTTATGCA 

TGCAGAACATTTGGGAAATGGTAGAAGACAGCAGCAAGCATAGT 

GCTGAATTGCACATAATCAGGTGCCAACTGCATTCCCTTCCTTA 
ACTAATCT 


NO. 253 


KCP_3UTR 
3_398480 


AACT'X-'rCTCCTCAGCAAAGAGCTCTCCTCTGTTCCCTGAATCCT 
GGATATCCCACTGGGTCCTCTAGTGACCCCAAGCTTCAGCCTCG 
CATGCCCTCTTCTCGAACAGAGAAGGCAGGAGGGAAGCAGGGAC 
CAGCC CCTGCTCCATCTTCCAGGATTCCAGGCCTCCCTGGCCTG 
GACAAGCCCTGAGCTGGCAGTTAGGAGAGCAGAGGTTGTGAATC 
TGGTGGGACCCCCAGCAGGTCTTTCTGGCTCAGTGCCCTCATCT 
GTGAGCAGGGGTTCCCCAGGAGACCACGACAGAGGCCTGGAACC 
CAAGTTCTAATCCCACATCCTGGCTGGGCAACTTCAGGCAAATT 
TCTAACACAAGGTAAGCCTCAATTTCTCTCTGGGGTAATGATCA 
GGCACCTGCTTAATTCACAGGGGTTTGGTGGGCATCA [C/T] GT 
GGACAATGTGGTTGCACAGCAGTGGGCAATGCAAAGGAAAGGAA 
GTATGTTAGTAAGTGCCCCTCCCCTGTTGCACAAAACAGGACAC 
ATGCTGGGATTGCAGAAAAGC!AAT A A ATrtPTrsr 1 a r» a pptp * » a 

AAACTATTCAAGGACCCTGGCCAAGTCACAGGCTACCTGTGGCC 
CTGAGGGGACAGCTCATGGGTTGGCATTAGGGGAAGCAGCTCTC 
AAGGGGCCTGTATCCTGGGGATTCAACTCTGTGCCTATGTGGCA 
TTGAGCCTGTGTGAATGTGGTGACTGTCATGCTGTTTTGCTGTG 
TGTGCGTCTGCATGCCTGTGTGTTTGTGTGTCTCTCCACCTTCG 
TGGGGGGCAACTGTAGGTGTATTATGAGCCTTGGGTCTGTCTGT 

GTGTACAATAGCAATGTCTGTGCGGACTTAAGGACCTGCGCCCA 
TATGTTTGTGGGACTTTC 


SEQID 
NO. 254 


KCP_3UTR 
3_398605 


CAGAGAAGG C AGGAGGGAAGCAGGGACCAGC CCCTGCTC C ATC T 
TCCAGGATTCCAGGCCTCCCTGGCCTGGACAAGCCCTGAGCTGG 
CAGTTAGGAGAG C AG AGGTTGTGAATCTGGTGGGAC CCCCAGC A 
GGTCTTTCTGGCTCAGTGCCCTCATCTGTGAGCAGGGGTTCCCC 
AGGAGACCACGACAGAGGCCTGGAACCCAAGTTCTAATCCCACA 
TCCTGGCTGGGCAACTTCAGGCAAATTTCTAACACAAGGTAAGC 
CTCAATTTCTCTCTGGGGTAATGATCAGGCACCTGCTTAATTCA 
CAGGGGTTTGGTGGGCATCACGTGGACAATGTGGTTGCACAGCA 
GTGGGCAATGCAAAGGAAAGGAAGTATGTTAGTAAGTGCCCCTC 
C CCTGTTGC ACAAAAC AGGACACATGCTGGGATTGCAGAAAAG C 
AATAAATGCTGCA [C/T] AGGTGAAGAAAACTATTCAAGGACC C 
TGGCCAAGTCACAGGCTACCTGTGGCCCTGAGGGGACAGCTCAT 
GGGTTGGCATTAGGGGAAGCAGCTCTCAAGGGGCCTGTATCCTG 
GGGATTCAACTCTGTGCCTATGTGGCATTGAGCCTGTGTGAATG 
TGGTGACTGTCATGCTGTTTTGCTGTGTGTGCGTCTGCATGCCT 
GTGTGTTTGTGTGTCTCTCCACCTTCGTGGGGGGCAACTGTAGG 
TGTATTATGAGCCTTGGGTCTGTCTGTGTGTACAATAGCAATGT 


SEQID 
NO. 255 



SNSDOCID: <WO__2004041 193A2J_> 



WO 2004/041193 



PCT/US2003/034681 



-180- 





CTGTGCGGACTTAAGGACCTGCGCCCATATGTTTGTGGGACTTT 
CTGGGCATGCATGCTTGTTTATGAGGCCATACATCCGGGTATTC 
TGTGAACTGCTAGCATGGTGTGTATCTGTGTGGCAGACAGAAAA 
TGGCTGGGTGGGA 




KCP_elb_ 
3 99912 


ATCTCAGCACTTTGGGAGGCCAAGGCGGGTGGATCACCTGAGGT 
CAGGAGTTCAAGCCCAGCCAGCCCAACATGGCGAAACCCCGTCT 
CTATTAAAAAATACAAAAAAATTTAGCTGGGCCTAGTGGTGGGC 
GCCTGTAATCCCAGCTACTCCGGAGGCTGAGGCAGGAGAATCGC 
TTGAATCTGGGAGGCAGAGGTTGCAGTGAGCAGAGATCGCACCA 
CTGCACTCCAGCCTGGGCAACAGAGCGAGACTCCGTCTCAAAAA 
AAAAAAAAAAAGAAAAAGAAAAATGAGAGTGTAAGGGCCCAGAG 
GGGCTGAGGGCTCCTTTCTCCTCCCCAACTCCCTGTCACTAGAA 
GGTGGGCCCTGCCATAGGAGGATTCTGCAGAACCCTCAAGGACC 
CGCGGAGCAGGACGGCACCTTCTTCCCATGACCACCCATTTGGA 
TGTGTTTTTCACCCCTTTCTGGGTGGGGCAGACTTTCCCCCTCC 
CCATGAGTTCAGGCAG [G/T] GGGTTAAATAAGATTTC C CTTGA 
AGTCGAATGAAATCACAATGCACCACACACAGGGACACACACAC 
ACACACACGCACGCACGCACATCACACACACACACACACA.CACA 
CACACACACACACACACATACACACACACAGTCTCCCTGGGGCC 
AATCTACTGCCGCCTGAACCTCACCCATCAGCCAGGTGCCTGGC 
CCCGGGTCTGTCTCTTAGGGTTACATGCTCCCGGGGCTCCCGCA 
CATACCCCGGCAGATGAGGGTGCGCAGGGGTGAGGGCGCAGGGC 
TGGGCGTCCCCCGCCCCCACCGTGCAGCCCTCGCCCCCGCCCCG 
CCCCTCCGTAGTTGCCCGCCCGCCGCCCCCTCCGCCGCCCCCTC 
CGCCGCTCCGACTCTCGCCCCGAGCGCTGGCAGCAGGCAGCAGG 
CAGCAGGCGGGCGCGCTGTGGCTCCGCGCCGCGCGGTCCGGGCT 
CTGTTCATTCATGATTGGTACTCGGCCCTCCGAGACC 


SEQID 
NO. 256 


rsl02685 


AGCACTCCTGGGGCTCATTGTTAAGTTTATAAAACTCAGAGCTG 
ATGAGTTGTGTGCACTGTGTGGGTCTGAGTGGGCTTATGACTCC 
C CTCCAAGC CTGGCTGTAAGAATCTAAGACTTAAAGCTGAAGGA 
CCAAATGGGACTTTCTGTCCCATCCCCTCTCTGCTCCATGCAAG 
CACCAA [C/T] GTGGATTTTTGCCCCTAATTATATTAGGGAACG 
CTGTCAATCAAAAAGATGATGTTAAACTGATC CAGAACAAACC A 
AACCATGTTTAAGGGGAAGAAAAGATTACATCTTCAAATGCCAG 
CATGCCATCATTAATACAATGTCTAATGTAGTCAATATAGTTCA 
GGCAACATTGAAAATGAACCACTGCAAATACTAGGAATACAATT 
TCAAGAGGAAGCACAACATTCTGTGTTTCTATGCACACAGTCCT 
GTAAATTATTTGCAGCTCAAGTATGTCATGTTnTTTTAAATTTT 
CCCCTGGGTACAGCTTGAACAACTTCGTACAAGTGTTGATATGT 
CATATTCTCATTATCATTTAGTTCAAAATTACCATGATTTAATT 
ACCATGAGGTTGCTTTTTTGATACATGAGTTACTTAGAAATTGA 
ATTAggctaggcatggtggcttccacctataatcctagcacttt 
ggaaggccaaggcaggaggattgcttgagtttgaggccagtcta 
ggcaatatagtgagacctcatctccccaaaagtacaaaaaaact 
agccaggcatggggacacatgcctatagttccagctactcaaag 
gctgaggtggggaggattgctttgagcctggg 


SEQID 
NO. 257 


rs905808 


GCCAGCTATCCCCAGAGACATCACAGGAGAAGGAGCAGAAGCTG 
GAACATCATC CGGGAG CTGGACTAGAACGTCC CGGGAAACTTC A 
GCCTGGCTTCTGCTTTGTCCCGAAAACCCAGGGGCTCCAGCTCC 
AGGGCTGTGTCTTAGAATGAGGCAGTTTATCTGTTCAGGGCTTC 
TCTTAGTTTTTAATCC CAATAGGAC ACA [C/T] GTTGTATTAAA 
AAGCCATGCGAGATGGAAGAAGGAAATTGAATGAAATTTGAGGG 
CAGG TAGGAG CAG AG A CAATAAATAATT C AGCAG TGAAGGAAG C 
AGAAAAAAGATTGCACTCATTTCGCCCTTCAACAATTATACTAA 
ACACCTGCTCTGGGCCACAGAAGGGCCAGATCCCATTCCTGTGC 
TCAGGAAGCCCACAGGCCGGCAGGGAGAGGCTGGTTGGAATGTG 
TGCTTTGCACTGTAACGGAGGCATCGAGCATGGTAAGGGACTGG 


SEQID 
NO. 258 



8NSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 



PCT/US2003/034681 



-181- 




Tablel 1 . The Build 33 location of SNPs and microsatellites employed for the &st- 
pass association analysis across KChlPl. 

5 



Start (B33) 


Marker 


Public 
Alias 


HePODF al-iac 


v an an on 


169788696. 


DG5S47 








169794522 


DG5S1592 








169843903 


DG5S119 








16986QR4S 


rS7jjujo 


rsyojojo 


DCjOOAAFCS 


A/G 


1 69R699SS 


rszo J7U7 1 


rszjjyuy 1 


DG00AAFCI 


G/T 


169961410 










169964087 




rsyi/DoUo 


IXjOOAAFCG 


C/T 


170006645 


rs883849 


rs883849 


DG00AAFCK, 
DG00AAIOG 


A/G 


170015858 


DG5S123 










rc?1 3^04^ 
I fc>Z Ijj U^fO. 


rszJo_>U4o 


DG00AAFCJ, 

r\nAA A A T/^TT 

DtiOOAAlOH 


C/T 




J-xvJ .J O 1 








170056955 


I oZ. J> J7 kDZr 


rszj 1 


JJLrUUAArCR 


A/G 


170064881 


r«532946R 

1 OJA.7^UO 




T^riAH A A T7^~ , 1LT 

IJutUUAArCii 


A/G 


170070041 


rs50057 


rs50057 


DG00AAFCF, 
DG00AAIOI 


A/G 


170070735 


rsl 02685 


rsl02685 


DG00AAFCE, 
DG00AAIOJ 


C/T 


170073252 


rs50364 


rs50364 


DG00AAFCD 


A/G 


170081291 


KCP 1152 




SG05S176 


C/T 


170081473 


KCP 1333 




SG05S921 


A/G 


170082788 


KCP 2649 




SG05S923 


C/T 


170085115 


KCP 4976 




DG00AAGHK, 
DG00AAHUT, 
DG00AAINX 


C/T 


170085217 


KCP 5077 




DG00AAINZ 


A/T ! 


170095540 


KCP 15400 




SG05S946 


C/T ! 


170096291 


KCP 16152 


rs486818 


SG05S948 


A/G 


170098209 


KCP 18069 


rsl363712 


SG05S189 


C/T 


170105556 


D5S625 









.2004041 193A2_L> 
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-182- 
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Table 12. Hie Build 33 location of SNPs found through sequencing across 



Build 33 
Pos 


Project 
Pos 


DECODE 
ALIAS 


SEQ 
PROJECT 
ALIAS 


PUBLIC 
ALIAS 


SNP 


169866787 


9677 


DG00AAHAF 


KCP 9677 




C/G 


169867465- 


10355 


SG05S229 


KCP 10355 




A/T 


169867556 


10446 


DG00AAHA 
R 


KCP 10446 




C/G 


169871957 


14847 


SG05S485 


KCP 14847 


rs486768 


A/T 


169872129 


15019 


SG05S1298 


KCP 15019 


rs4867973 


A/G 


169872417 


15307 


SG05S437 


KCP 15307 




A/C 


169872421 


15311 


SG05S438 


KCP 15311 




A/T 


169872435 


15325 


SG05S439 


KCP_15325 




C/G 


169872949 


15839 


SG05S440 


KCP 15839 




A/G 


169873539 


16429 


SG05S486 


KCP 16429 




C/T 


169873680 


16570 


SG05S487 


KCP_16570 




A/G 


169875123 


18013 


SG05S488 


KCP 18013 




A/T 


169875568 


18458 


SG05S1002 


KCP 18458 




A/G 


169876302 


19192 


SG05S489 


KCP 19192 




A/G 


169878365 


21255 


SG05S490 


KCP 21255 




G/T 


169878734 


21624 


SG05S491 


KCP_21624 


rs486769 


A/G 


169879678 


22568 


SG05S492 


KCP 22568 




A/C 


169879717 


22607 


SG05S493 


KCP 22607 


• 


C/T 


169881496 


24386 


SG05S494 


KCP 24386 




A/G 


169882681 


25571 


SG05S495 


KCP 25571 




A7C 


169883265 


26155 


SG05S496 


KCP_26155 




A/G 






b(j05S497 


KCP 26223 




C/G 


169883413 


26303 


SG05S498 


KCP 26303 




A/G 


169883465 


26355 , 


3G05S1171 


KCP_26355 


I 


C/G . 


169883518 


26408 ! 


3G05S499 


KCP_26408 


i 


A/T 


169883738 : 


26628 ! 


3G05S500 


K.CP 26628 


i 


A/G 


169883811 : 


26701 ! 


3G05S501 


ECCP 26701 


1 


A'G 


169884084 : 


26974 ! 


3G05S1172 


fCCP 26974 


( 


:/t 


169884145 : 


27035 ! 


3G05S502 


K:CP_27035 


( 


3/T 


169884439 : 


27329 ! 


3G05S503 


KCP 27329 


( 


:/t 


169884682 : 


27572 ! 


3G05S504 


£CP 27572 




A/G 



.20040411 93 A2_l_> 



WO 2004/041193 



-183- 
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169884707 


27597 


DG00AATHT 


KCP 97S97 




rV/VJ 


169884973 


27863 


SG05S5O5 


PTPP 97R61 




a in 

n/U 


169885005 

JL \J *S \J U V \S -J 


27895 


SG05S506 


KX^P 97R9^ 






169888453 


31343 


SG05S507 


KCT> 11141 

XVV-'X J 1 JTJ 


rSHOO ly Ij 


\J 1 


169889433 


32323 




l^Pp 10191 
xV^x jZjZj 




U/ 1 


169889680 


32570 




TTPP 19^70 




a 


169890025 


3291 5 




xV^x jZ7 ID 




a /n 
A/(jr 


169890055 

X v uy VU J _J 


39945 


^O0S<s1 1 71 


irr'P 1704^ 




A //^ 


169890089 


19979 


QflO^ CI 1 74 

0\JU «70 1 1 /*+ 


iT'r^p 0O070 




A //""* 


1 69890991 

X \J O y\J^.y X 


111 R1 


COO^C^IO 


xV^Jr Jjiol 




A/Cjt 


1 698991 99 


1^01 9 


Qf^KQ 1 1 7^ 


IS-^Jr 3 3 U I Z 




A //^ 


1 6989233? 


1^999 


OUuJOJ 1 J. 






A //^ 

A/Cj 


169899574 


1S414 


<sG0^C61 


xS-V^Jr jjtIt 




/-« /TP 


1 69R9961 Q 


1SSOQ 

jDj\Jy 


COO^C^I 9 


17" /^P QCC/^Q 

xV^Jr JDDUy 




C/T 


1 698996R7 


1^77 


^r^o^^s^i i 

OUUJOJlJ 


IVl^Jr jjj// 




G/T 


1 698911 S7 




coo^c^i 4 


tv^x OOU4 / 




C/T 


169891169 


160SQ 


CfTO<C*;i ^ 
puUjjj x D 


jx.vr_jOUj y 




C/T 


1 69891871 


16761 

-?U / VJ JL 


jUUjojIO 


iv^x^ JO /Ox 




A/C 


169894061 


36951 

•JVJ^.7 1 


<sG0^^17 


ivv^x JOi7D 1 




A //^» 


1 698941SR 


1794R 


Cf^O^C^I S 
ovjrV j 1 o 


xVv^Jr_j) / 24 o 




C/G 


1 69R9SS07 


1R1Q7 


Cf^n^Cl 1 If. 
oUUjo 1 1 / O 


xvCjt oojy / 




O /HP i 

C/T 


1 6989S699 

X \J Z7 O y J VJ _7 .7 


JOJ07 


O VJU -> O -7 .7 3 






A/C 


1 69R96199 


1Q919 




IvL/Jr DyllZ 




G/T 


1 698961S7 


10947 


CnO*\C^90 


ssX^r DyZ*+f 




A/G 


1 69R9616Q 






fsX^r 5yZjy 




A/G 


1 69896451 


19141 


cno^ci 1 77 

DVJU.70 XX// 


xVL^i^ jyj4l 




A //— < 

A/G 




19S17' 




irr^P io^^'7 




C/l 


169896750 

X \J Is kj ^? \j / V7 


19640 


QflOSC^91 






A/1 


169896914 


19804 

J70VT 


^(^^^^94 

JUVJuJ-T 


KT^P 1QS04 
tW-'Jr_j 7oUt 




A/G 


169897484 


40374 




KTP A0174 




pi /rp 

C/ 1 


169897594 


40484 


^O0S9^96 


W^x HUtOt 




a /n 


169897621 


4051 1 

*J X X 


^r^0^^S97 






C/ 1 


169897856 


40746 

r v / Tu 


^rrOS*S^9R 
jVJvJOJZO 


K'PP 40746 
V^x tU/tO i 




C/ 1 


169898205 


41095 


SG05S529 


KCP 41095 ! 




C/T 


169898252 


41142 


SG05S530 


BCCP 41142 


( 


C/T 


169898371 


41261 


SG05S531 


K:CP_41261 




VG 


169899446 


42336 


SG05S532 


K1CP_42336 


j 


A/G 


169899693 


42583 


SG05S533 ] 


ECCP_42583 




A7G 


169900156 


43046 


SG05S534 ] 


BCCP 43046 




VG 
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169900425 


43315 


SG05S1178 


KCP 43315 




C/Cr 


169900629 


43519 


SG05S535 


BCCP 43519 

*-VVl "T J J 1 7 




C/T 


169902212 


45102 


SG05S536 


ECCP 45109 


rc91 1961 


A/fr 

n/VJ 


1 69902400 


45290 


SG05S537 


KCP 45290 




fr/T 
vJT/ 1 


169903206 

-L \J m/ ~S \J -J \J \J 


46096 


9G05953R 


KfT 460Q6 




L^/ 1 


169903615 


46505 


9G059539 


KCP 46505 

xvV^x *tU JUJ 






169903676 


46566 


.90059540 

LJ VJ V7 J O -7 H \7 


KCP 4656£ 

xW^JT HDD OO 






169903766 


46656 


90059541 

OVJVJuJt x 


IvV^i H-OOjO 






169904530 


47490 


90059549 

OVJV Jo .JH^ 


ITOP 47A90 




u/ 1 


169904757 


H / 17H / 


90059543 

»J VJV-7 O ->*t«J 


JSA^r *t l Oh- / 




A /Pi 


1 69906262 


4Q1 5? 


900591 1 70 


PTOP 4Q1 ^9 
IVv^x Hl7 IjZ 




A /fir 


169906576 


40466 


90059544 

O VJU J O «7 HH 






A /Pi 


1 69906R46 


40736 

Hj? / JO 


90059545 


iVv-^x Hi7 / jO 




A /T 


1 69907R66 


50756 

-7U / JL> 


9pV"K9 1 1 QO 


xVV^Jr DU/DO 




A /Pi 
A/Lj 


16990R937 

J. \70_7«7 / 


51R97 


900591 1 R1 

OLTUJOllOl 








169909190 


520R0 


900591 1 R9 


TTOP ^90R0 




PVT 
U/ 1 


169910099 


529R9 


90059546 

OVJV/JOJ'ru 






A /Pi 


169910133 




9G059547 

OUUJOJH' / 


xVUr _ D D UZj 




U/ 1 


169911784 

X \J S ~f AX/ 


54674 


9O05954R 

lj yJ\J J O J H O 


iV\^Jr jhO / H* 




a /r* 

A7 


169911823 


54713 

J*T / X.7 


9O05954Q 


xX»v^x J*t / 1 J 




A /P* 


1699130R6 


55976 


900591 1 R3 


JLVv^x JJ7 / □ 




A /Pi 


16991341 5 

X V.7 .7 U*T X J 


56305 


90059^9 


ivv^Jr jOjUj 




A /Pi 


169913670 

X V/.7 UVJ / V/ 


56560 


9O059Q54 


xVv^xr jOjOU 




p'/nr 


1 6991 30RR 

1 \JZ7Zs 1J70O 


56R7R 


900S9^^0 

OVJTUJO JJU 


xXAwir jOo/o 




U/Cj 


169914731 


57691 


9c^0S95^1 


Ifpp ^7^7 1 
iVv^Ir D / OZ 1 




A //^i 
A/Ur 


169914RR7 

Jl U771*TOO / 


57777 


9O059559 


krfP ^7777 




A /Pi. 


169915597 

X \J ~S X J J .7 / 


5R4R7 

.7 OtO / 


90059553 

OVJvJO JJJ 


rpp ^R/La7 

iVv^x J OHO / 




A /Pi 
A/O 


169917130 

iv-/y l / X J VJ 


60020 

17 V7\JZ.\7 


9O059554 

OVJUJOJJ*t 


?TP £0090 






169917579 


60469 


9G05S555 


KCP 60460 




a /p; 


169917813 


60703 


SG05S556 


CCP 60703 




A /P^- 
A/VJ 


169919906 


62096 


SG05S557 


CCP 62096 




A/vjr 


169919909 


62799 


SG05S233 


CCP 62790 

LV^V_^X UX» t S ^ 




n/T 
0/ ± 


169921008 


63898 


SG05S55S 


KCP 63R9R 




a /Pi 
Ay vj 


169921407 


64297 


SG05S559 


KLCP 64297 




A/G 


169921917 


64807 


SG05S560 


K1CP_64807 




3/T 


169922010 


64900 


SG05S1184 


K1CP 64900 I 




A7G 


169922309 


65199 


SG05S955 


K1CP_65199 




VG 


169922397 


65287 


SG05S561 


fCCP 65287 




3/T ! 


169923449 


66339 


SG05S562 


K1CP_66339 




VG 
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. PCT/US20O3/034681 
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169923611 


66501 


SG05S563 


KCP_66501 




A/G 


169924005 


66895 


SG05S564 


KCP_66895 




A/G 


169925422 


68312 


SG05S956 


KCP 68312 




A/C 


169926039 


68929 


SG05S565 


KCP_68929 




C/T 


169926454 


69344 


SG05S566 


KCP 69344 




A/G 


169926756 


69646 


SG05S567 


KCP 69646 




C/T 


169927013 


69903 


SGOSS568 


KCP 69903 




A/G 


169927893 


70783 


SG05S569 


KCP 70783 




P/T 


169928063 


70953 


SG05S570 


KCP 70953 




A/T 


169928076 


70966 


SG05S571 


KCP 70966 






169928444 


71334 


SG05S572 


KCP 71334 




P/T 


169928522 


71412 


SG05S573 


KCP 71412 




A/T 

r\J x 


169928555 


71445 


SG05S1185 


KCP 71445 




P/T 

v-»/ JL 


169928665 

X w *r *S ±* \S V/ 


71555 


SG05S1186 


KCP 71555 




P/T 
v-»/ JL 


169928700 


71590 


SG05S1187 


KCP 715Q0 




P/T 

W X 


169929635 


72525 


SG05S574 


KCP 72S2S 


rQ49^0?07 


A/O 
n/vJ 


169929849 


72739 


SG05S575 


KCP 72739 




P/O 


169930171 


73061 


SG05S576 


KCP 73061 

XW^X / V/ VJ X 




P/T 


169930506 


73396 


SG05S577 


KCP 73396 




A/T 


169930538 


73428 


SG05S578 


KCP 734? R 


ibT-oo / y / o 




169930644 


73534 


SG05S579 


KCP 73*534 




P/T 


169931073 


73963 


SG05S580 






CI (It 


169931425 


74315 


SG05S581 


KCP 743 1 5 




a/p; 


169931663 


74553 


SG05S582 


KCP 74SS3 




Lj/ 1 


169931670 


74560 


SG05S583 


KCP 74560 




P/T 


169932137 


75027 


SG05S584 


KCP 75027~ 




P/T 


169932696 


75586 


SG05S585 


KCP 75586 






169932998 


75888 


SG05S586 


KCP 75888 




P/T 
X 


169933181 


76071 


SG05S587 


KCP 76071 


L OvJ / JO 


a/p; 


169933212 


76102 


SG05S588 


KCP 76102 


rs38675Q 


P/T 
o/ x 


169933256 


76146 


SG05S589 


KCP 76146 




A/Pr 


169933389 


76279 


SG05S1188 


KCP 76279 


rq43fi874^ ( 

LO*TJ \JO / *tv,l ' 


P/T 
o/ X 


169933420 


76310 


SG05S590 


KCP 76310 


< 


P/T 

O/ X 


169933699 


76589 


SG05S591 


KCP_76589 




C/T 


169933756 


76646 


SG05S592 


KCP_76646 


( 


C/T 


169934348 


77238 


SG05S593 


KCP_77238 


1 


3/T 


169934429 


77319 


SG05S594 


KCP 77319 


1 


C/G 


169934556 


77446 


SG05S595 


KCPJ77446 


1 


C/T 


169934663 


77553 


SG05S596 


KCP__77553 


( 


C/T 
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169934751 


77641 


SG05S597 


KCP 77641 


rs4242157 


A/G 


169934936 


77826 


SG05S598 


KCP 77826 




C/G 


169934949 


77839 


SG05S599 


KCP 77839 




A/G 


169935134 


78024 


SG05S600 


KCP 78024 


rs4867981 


A/G 


169935240 


78130 


SG05S601 


KCP_78130 


rs4867614 


C/T 


169935254 


78144 


SG05S602 


KCP 78144 




A/C 


169935713 


78603 


SGO5S603 


KCP_78603 




C/T 


169935892 


78782 


SG05S604 


KCP_78782 




A/G 


169935939 


78829 


SGO5S605 


KCP_78829 




A/G 


169935989 


78879 


SG05S606 


KCP 78879 




C/T 


169936272 


79162 


SG05S607 


KCP 79162 




C/T 


169936275 


79165 


SG05S608 


KCP 79165 




C/T 


169936329 


79219 


5G05S609 


KCP 79219 




G/T 


169936495 


79385 


SG05S610 


KCP 79385 




C/T 


169936910 


79800 


SG05S611 


KCP_79800 




C/G 


169937029 


79919 


SG05S1189 


KCP 79919 




A/G 


169937270 


80160 


SG05S612 


KCP 80160 




A/G 


169937896 


80786 


SG05S613 


KCP 80786 




A/G 


169938126 


81016 


SG05S614 


KCP 81016 




C/T 


169938400 


81290 


SG05S615 


KCP 81290 




A/G 


169938894 


81784 


SG05S1190 


KCP 817S4 




A/G 


169939578 


82468, 


SG05S957 


KCP 82468 


rs4242158 


A/G 


169940311 


83201 


SG05S616 


KCP_83201 




C/T 


169940995 


83885 


SG05S617 


KCP 83885 




A/G 


169941106 


83996 


SG05S618 


KCP_83996 


rs4867615 


A/G 


169941897 


84787 


SG05S1191 


KCP_84787 




A/T 


169942667 


85557 


SG05S619 


KCP 85557 




A/G 


169942775 


85665 


SG05S620 


KCP_85665 




C/T 


169942903 


85793 


SG05S958 


KCP_85793 




C/T 


169943046 


85936 


SG05S621 


KCP 85936 




A/G 


169943817 


86707 


SG05S622 


KCP_86707 




A/T 


169944237 


87127 


SG05S623 


KCP_87127 




C/G 


169945487 


88377 


SG05S624 


KCP 88377 




C/T 


169945857 


88747 


SG05S625 


KCP 88747 




A/T 


169945886 


88776 


SG05S626 


KCP_88776 




C/T 


169945923 


88813 


SG05S627 


KCP_88813 




A/G 


169946380 


89270 


SG05S628 


KCP_89270 




A/G | 


169946491 


89381 


SG05S629 


KCP_89381 


rs4867983 


A/G ! 


169947228 


90118 


SGO5S630 


KCP_90118 ! 




A/G 
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-187- 
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169947236 


90126 


SG05S631 


KCP 90126 




G/T 


169947285 


90175 


SG05S632 


KCP 90175 




C/T 


1699 47 '471 


90361 


SG05S633 


KCP 90361 




C/G 


169947529 


90419 


SG05S634 


KCP 90419 




C/T 


169947661 


90551 


SG05S635 


KCP 90551 




A/G 


169947834 


90724 


SG05S636 


KCP 90724 




A/G 


169948187 


91077 


SG05S637 


KCP 91077 




A/G 


169948683 


91573 


SG05S1192 


KCP_91573 




A/G 


169948703 


91593 


SG05S1193 


KCP_91593 




G/T 


169948722 


91612 


SG05S1194 


KCP_91612 




A/G 


169948755 


91645 


SG05S1195 


KCP_91645 




C/T 


169948788 


91678 


SG05S1196 


KCP_91678 




A/G 


169948798 


91688 


SG05S1197 


KCP 91688 




C/T 


169948977 


91867 


SG05S638 


KCP_91867 




C/T 


169949063 


91953 


SG05S639 


KCP_91953 




C/T 


169949229 


92119 


SG05S640 


KCP 92119 




C/T 
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Table 13. The Build 33 location of SNPs and microsatellites employed for the 
subsequent association analysis across KChlPl. 
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reference in their entirety. While this invention has been particularly shown and 
described with references to preferred embodiments thereof, it will be understood by 
. those skilled in the art that various changes in form and details may be made therein 
5 without departing from the scope of the invention encompassed by the appended 
claims. 
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CLAIMS 



What is claimed is: 



10 



A method of diagnosing a susceptibility to Type II diabetes in an individual, 
comprising detecting a polymorphism in a KChlPl nucleic acid, wherein the 
presence of the polymorphism in the nucleic acid is indicative of a 
susceptibility to Type II diabetes. 



2. A method of diagnosing a susceptibility to Type II diabetes comprising 
detecting an alteration in the expression or composition of a polypeptide 
encoded by KChlPl nucleic acid in a test sample, in comparison with the 
expression or composition of a polypeptide encoded by a KChlPl nucleic acid 

15 in a control sample, wherein the presence of an alteration in expression or 

composition of the polypeptide in the test sample is indicative of a 
susceptibility to Type II diabetes. 

3. The method of Claim 1, wherein the polymorphism in the KChlPlnucleic acid 
20 is indicated by detecting the presence of a least one of the polymorphisms 

indicated in Table 13. 

4. An isolated nucleic acid molecule comprising a KChlPl nucleic acid, wherein 
the KChlPl nucleic acid has a nucleotide sequence selected from the group of 

25 nucleic acid sequences as shown in Table 10, or the complements of the group 

of nucleic acid sequences as shown in Table 10, wherein the nucleotide . 
sequence contains a polymorphism. 

5. An isolated nucleic acid molecule which hybridizes under high stringency 
30 conditions to a nucleotide sequence selected from the group of nucleic acid 

sequences as shown in Table 10, or the complements of the group of nucleic 
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acid sequences as shown in Table 10, wherein the nucleotide sequence 
contains a polymorphism. 

6. A method for assaying for the presence of a first nucleic acid molecule in a 
5 sample, comprising contacting said sample with a second nucleic acid 

molecule, where the second nucleic acid molecule comprises a nucleotide 
sequence selected from the group consisting of: nucleic acid sequences as 
shown in Table 10 and the complement of the nucleic acid sequences as 
shown in Table 10, wherein the nucleotide sequence contains a polymorphism 
10 and hybridizes to the first nucleic acid under high stringency conditions. 

7. A vector comprising an isolated nucleic acid molecule selected from the group 
consisting of: 

a) nucleic acid sequences as shown in Table 10; and 
15 b) complement of one of the nucleic acid sequences are shown in 

Table 10; and 

wherein the nucleic acid molecule contains a polymorphism and is 
operably linked to a regulatory sequence. 

20 8. A recombinant host cell comprising the vector of Claim 7. 

9. A method for producing a polypeptide encoded by an isolated nucleic acid 
molecule having a polymorphism, comprising culturing the recombinant host 
cell of Claim 10 under conditions suitable for expression of the nucleic acid 

25 molecule. 

10. A method of assaying for the presence of a polypeptide encoded by an isolated 
nucleic acid molecule according to Claim 4 in a sample, the method 
comprising contacting the sample with an antibody which specifically binds to 

30 the encoded polypeptide. 
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11. A method of identifying an agent that alters expression of a KCHIP lnucleic 
acid, comprising: 

a) contacting a solution containing a nucleic acid comprising 

the promoter region of the KCHIP l nucleic acid 
5 operably linked to a reporter gene with an agent to be 

tested; 

b) assessing the level of expression of the reporter gene; and 

c) comparing the level of expression with a level of expression of the 

reporter gene in the absence of the agent; wherein if the level of 
1° expression of the reporter gene in the presence of the agent 

differs, by an amount that is statistically significant, from the 
level of expression in the absence of the agent, then the agent is 
an agent that alters expression of the KCHIP 1 nucleic acid. 

15 

12. An agent that alters expression of the KCHIP lnucleic acid, identifiable 
according to the method of Claim 11. 

13. A method of identifying an agent that alters expression of a KCHIP 1 nucleic 
20 acid, comprising: 

a) contacting a solution containing a nucleic acid of Claim 
1 or a derivative or fragment thereof with an agent to be 
tested; 

b) comparing expression with expression of the nucleic acid, 

25 derivative 

or fragment in the absence of the agent; 
wherein if expression of the nucleotide, derivative or fragment in the 
presence of the agent differs, by an amount that is statistically 
significant, from the expression in the absence of the agent, then the 

30 agent is an agent that alters expression of the KCHIP lnucleic acid. 



NSDOCID: <WO 2004041 1 93A2_I_> 



WO 2004/041193 PCT/US2003/034681 

-209- 

14. The method of Claim 13, wherein the expression of the nucleotide, derivative 
or fragment in the presence of the agent comprises expression of one or more 
splicing variant(s) that differ in kind or in quantity from the expression of one 
or more splicing variant(s) the absence of the agent. 

5 . 

15. An agent that alters expression of a KChlPlnucleic acid, identifiable 
according to the method of Claim 14. 

16. An agent that alters expression of a KChlPl nucleic acid, selected from the 
10 group consisting of: antisense nucleic acid to a KChlPl nucleic acid; a 

KChlPl polypeptide; a KChlPl nucleic acid receptor; a KChlPl binding 
agent; a peptidomimetic; a fusion protein; a prodrug thereof; an antibody; and 
a ribozyme. 

15 17. A method of altering expression of a KChlPl nucleic acid, comprising 

contacting a cell containing a KChlPl nucleic acid with an agent of Claim 16. 

18. A method of identifying a polypeptide which interacts with a KChlPl 
polypeptide comprising a polymorphism indicated in Table 13, comprising 

20 employing a yeast two-hybrid system using a first vector which comprises a 

nucleic acid encoding a DNA binding domain and a KChIP 1 polypeptide, 
splicing variant, or a fragment or derivative thereof, and a second vector which 
comprises a nucleic acid encoding a transcription activation domain and a 
nucleic acid encoding a test polypeptide, wherein if transcriptional activation 

25 occurs in the yeast two-hybrid system, the test polypeptide is a polypeptide 

which interacts with a KChIP 1 polypeptide. 

19. A Type II diabetes therapeutic agent selected from the group consisting of: a 
KChlPl nucleic acid or fragment or derivative thereof; a polypeptide encoded 

30 by a KChlPl nucleic acid; a KChlPl receptor; a KChlPl nucleic acid binding 

agent; a peptidomimetic; a fusion protein; a prodrug; an antibody; an agent 
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that alters KChlPl nucleic acid expression; an agent that alters activity of a 
polypeptide encoded by a KChlPl nucleic acid; an agent that alters 
posttranscriptional processing of a polypeptide encoded by a KChlPl nucleic 
acid; an agent that alters interaction of a KChlPl nucleic acid with a KChlPl 
5 binding agent; an agent that alters transcription of splicing variants encoded by 

a KChlPl nucleic acid; and a ribozyme. 

20. A pharmaceutical composition comprising a Type II diabetes therapeutic agent 
of Claim 19. 

10 

2 1 . The pharmaceutical composition of Claim 20, wherein the Type II diabetes 
therapeutic agent is an isolated nucleic acid molecule comprising a KChlPl 
nucleic acid or fragment or derivative thereof. 



15 22. The pharmaceutical composition of Claim 20, wherein the Type H diabetes 
' therapeutic agent is a polypeptide encoded by the KChlPl nucleic acid. 

23. A method of treating a disease or condition associated with KChlPl in an 
individual, comprising administering a Type II diabetes therapeutic agent to 

20 the individual, in a therapeutically effective amount. 

24. The method of Claim 23, wherein the Type II diabetes therapeutic agent is a 
KChlPl nucleic acid agonist. 

25 25. The method of Claim 23 wherein the Type II diabetes therapeutic agent is a 
KChlPl nucleic acid antagonist. 

26. A transgenic animal comprising a nucleic acid selected from the group 

consisting of: an exogenous KChlPl nucleic acid and a nucleic acid encoding 
30 a KChlPl polypeptide. 
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27. A method for assaying a sample for the presence of a KChIP 1 nucleic acid, 
comprising: 

a) contacting said sample with a nucleic acid comprising a 
contiguous nucleotide sequence which is at least partially 

5 complementary to a part of the sequence of said KChlPlgene 

under conditions appropriate for hybridization, and 

b) assessing whether hybridization has occurred between a 
KChlPlgene nucleic acid and said nucleic acid comprising a 
contiguous nucleotide sequence which is at least partially 

10 complementary to a part of the sequence of said KChlPl nucleic 

acid; 

wherein if hybridization has occurred, a KChlPl nucleic acid is present 
in the nucleic acid. 

15 28. The method of Claim 27, wherein said nucleic acid comprising a contiguous 
nucleotide sequence is completely complementary to a part of the sequence of 
said KChlPl nucleic acid. 

29. The method of Claim 27, further comprising amplification of at least part of 
20 said KChlPl nucleic acid. 

30. The method of Claim 27, wherein said contiguous nucleotide sequence is 100 
or fewer nucleotides in length and is either: a) at least 80% identical to a 
contiguous sequence of nucleotides in one of the nucleic acid sequences as 

25 shown in Table 10; b) at least 80% identical to the complement of a 

contiguous sequence of nucleotides in one of the nucleic acid sequences as 
shown in Table 10; or c) capable of selectively hybridizing to said KChlPl 
nucleic acid. 

30 31. A reagent for assaying a sample for the presence of a KChIP 1 nucleic acid, 
said reagent comprising a nucleic acid comprising a contiguous nucleotide 
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sequence which is at least partially complementary to a part of the nucleotide 
sequence of said KChJPl nucleic acid. ' 

32. The reagent of Claim 3 1 , wherein the nucleic acid comprises a contiguous 
5 nucleotide sequence, which is completely complementary to a part of the 

nucleotide sequence of said KChlPl nucleic acid. 

33. A reagent kit for assaying a sample for the presence of a KChDP 1 nucleic acid, 
comprising in separate containers: 

10 a) one or more labeled nucleic acids comprising a contiguous 

nucleotide 

sequence which is at least partially complementary to a part of the 
nucleotide sequence of said KChIP lnucleic acid, and 
% b) reagents for detection of said label. 

15 

34. The reagent kit of Claim 33, wherein the labeled nucleic acid comprises a 
contiguous nucleotide sequences which is completely complementary to a part 
of the nucleotide sequence of said KChJPl nucleic acid. 

20 35. A reagent kit for assaying a sample for the presence of a KChEP 1 nucleic acid, 
comprising one or more nucleic acids comprising a contiguous nucleic acid 
sequence which is at least partially complementary to a part of the nucleic acid 
sequence of said KChlPl nucleic acid, and which is capable of acting as a 
primer for said KChlPl nucleic acid when maintained under conditions for 

25 primer extension. 

36. The use of a nucleic acid which is 100 or fewer nucleotides in length and 
which is either: a) at least 80% identical to a contiguous sequence of 
nucleotides in one of the nucleic acid sequences as shown in Table 10; b) at 
30 least 80% identical to the complement of a contiguous sequence of nucleotides 

in one of the nucleic acid sequences as shown in Table 10; or c) capable of 
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selectively hybridizing to said KChlPl nucleic acid, for assaying a sample for 
the presence of a KChlPl nucleic acid. 

37. The use of a first nucleic acid which is 1 00 or fewer nucleotides in length and 
5 which is either: 

a) at least 80% identical to a contiguous sequence of nucleotides in 
one of the nucleic acid sequences as shown in Table 6; 

b) at least 80% identical to the complement of a contiguous 
sequence of nucleotides in one of the nucleic acid sequences as 

10 shown in Table 10; 

or 

c) capable of selectively hybridizing to said KChlPl nucleic acid; 
for assaying a sample for the presence of a KChlP 1 nucleic acid 
that has at least one nucleotide difference from the first nucleic 

15 acid. 

38. The use of a nucleic acid which is 100 or fewer nucleotides in length and 
which is either: 

a) at least 80% identical to a contiguous sequence of nucleotides in 
20 one of the nucleic acid sequences as shown in Table 10; 

b) at least 80% identical to the complement of a contiguous 
sequence of nucleotides in one of the nucleic acid sequences as 
shown in Table 10; 

or 

25 c) capable of selectively hybridizing to said KChlPl nucleic acid; 

for diagnosing a susceptibility to a disease or condition 
associated with a KChlPl . 

39. A method of diagnosing a susceptibility to Type H diabetes in an individual, 
30 comprising determining the presence or absence in the individual of a 

haplotype comprising a halotype shown in Table 2 or Table 5 at the 5q35 loci, 
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wherein the presence of the haplotype is diagnostic of susceptibility to Type II 
diabetes. 

40. The method of Claim 39, wherein determining the presence or absence of the 
5 haplotype comprises enzymatic amplification of nucleic acid from the 

individual. 

41. The method of claim 40, wherein determining the presence or absence of the 
haplotype further comprises electrophoretic analysis. 

10 

42. The method of claim 39, wherein determining the presence or absence of the 
haplotype further comprises restriction fragment length polymorphism 
analysis. 

15 43. The method of claim 39, wherein determining the presence or absence of the 
haplotype further comprises sequence analysis. 

44. A method of diagnosing a susceptibility to Type n diabetes in an individual, 
comprising: 

20 a ) obtaining a nucleic acid sample from said individual; and 

b) analyzing the nucleic acid sample for the presence or absence 
of a haplotype, comprising a haplotype shown in Table 2 or Table 5 
at the 5q35 loci comprising a KChlPlgene, wherein the presence of 
the haplotype is diagnostic for a susceptibility to Type II diabetes. 

25 

45. A method of diagnosing a susceptibility to Type n diabetes in an individual, 
comprising determining the presence or absence in the individual of a 
haplotype comprising one or more markers and/or single nucleotide 
polymorphisms as shown" in Table 13 in the locus on chromosome 5q35, 

30 wherein the presence of the haplotype is diagnostic of a susceptibility to Type 

II diabetes. 
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46. A method for the diagnosis and identification of a susceptibility to Type II 
diabetes in an individual, comprising: screening for an at-risk haplotype in the 
KChIP Inucleic acid that is more frequently present in an individual 
susceptible to Type II diabetes compared to an individual who is not 
susceptible to Type E diabetes wherein the at-risk haplotype increases the risk 
significantly. 

47. The method of Claim 46 wherein the significant increase is at least about 20%. 

48. The method of Claim 46 wherein the significant increase is identified as an 
odds ratio of at least about 1 .2. 



49. Use of a Type II diabetes therapeutic agent for the manufacture of a 

15 medicament for the treatment of a disease or condition associated with 

KCWPl in an individual. 

50. The use of Claim 49, wherein the Type H diabetes therapeutic agent is a 
KChlPl nucleic acid agonist. 



20 



51. The use of Clim 49, wherein the Tpe H diabetes therapeutic agent is a KChlPl 
antagonist. 



25 
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KchlPl genomic DNA (Build 33 Chr5: 169716198 - 170098453) 



caggagtggg 
atgcagaagc 
GGCTTCAGGG 

' CAAGTTCCTG 
GCAGAGCCTG 
AAGCTTGGGT 
GACCCTCAGC 
gatatggcct 
tggtgcaccc 
gaggttgctg 
agggaactca 
ggtgtgtggc 
gagcattgga 
gaaggttgta 
gtaattctgc 
gggccggcat 

" cacctcaagt 
tctggggctt 
tttcatgcct 
tggcaggtgg 
tatggaccag 
cagaaagaag 
tttacacttg 
tttgtggaga 
gctgaaaaaa 

gggggctggg 

ggagcccaca 
gaaaggctat 
ccagcgggtc 
atgatgggat 
tatttcccta 
aggtgggctg 
tggaggccag 
gctctcagtg 
aatttcacaa 
ctttcctcat 
gcaacccatt 
tggcgaccca 
ctgagatggg 
gggcagacta 
agaggcaggg 
acatcagagt 
tccctctctc 
gggactccgt 
tcctcjtggcc 
attgtgtctt 
• ccaagtgttt 
ggtgttctac 
aacatgggag 
agagagtcca 



tggagagagg 
cagactcgct 
GTGCATCCGT 
GGGTGCAGAA 
GCTCCCCTCC 
TCGTGAAATT 
AAAGgtatgg 
ggctggtcgc 
aggtcttgga 
acaacctgcg 
gctggctgta 
tgatacgtta 
agtggcttag 
ttttgttgat 
agaacagaga 
ctgctttgcc 
gccaagagcc 
cattcatcgt 
atgtccatgc 
tttgcaggag 
aattagggct 
gctaactcct 
gcaccaggga 
tgatgtcaag 
cgtctttaaa 
gagagaggca 
aaactttgat 
atgcagccag 
tggcccagtg 
tctgtttatc 
ccatccttct 
agcccccaat 
aagtggggta 
gctttacctc 
agtctcccca 
tacagatagc 
gcagcctctc 
gaccccacac 
ggaatcatgg 
aggagtgcct 
ctgagaaggt 
gttggcctct 
tgctcagaat 

ggggacctgg 

agggctcaga 
aaccgcatct 
acacagcccc 
ccgatccagg 
cagggccagg 
gaaaacggct 



gagggagagt 
gcagaggcag 
CACTCAGGGT 
GGTGGGCACT 
GCCACCATGA 
TGCCCAGACC 
aaactggcct 
attgcctcgg 
aaaagctggg 
tcctggcttt 
attgcctgga 
tgggcgggca 
cgtgactgga 
aaaagagaga 
cgcttgaggg 
ctggtggctt 
tcctctcata 
tctgctttct 
ctgtcttttc 
taactgtctg 
gcagccccag 
cagggcccag 
aggtgaggcc 
ggagagcagc 
tattcatgct 
ctggaagcag 
gggattttac 
atctcatgac 
tacgttgtgt 
tccttccttg 
gaagtggggc 
gtggagggcc 
aacaggaggt 
ctggagaagg 
ttcaatcctc 
aggacctaag 
atggagggct 
acagatgctc 
gggtcaacag 
ccaaacagtg 

gggaggaagc 

gcaaggaacc 
cccagaaact 
ccttcctgtg 
gaggcagccc 
gtcttctttg 
ggctctgcgc 
tgtttgctaa 
ctgggagcag 
ttctaatgtt" 



ttgccctgca ggctctctgg 
ctgtgctgtt cccggagcct 
TCATTCACCC AGGCAGGCTC 
GTCCCTTCTG GGTGCTGACA 
GCGGCTGCTC CAAAAGATGC 



ATCTTTAAGC 

tgacccttgc 

tgtggtgagc 

gaaattggtg 

gagtaggcgg 

actttggaaa 

gagggataga 

gtttcaagaa 

tttgatgcag 

ggccagtggg 

cagaaaccgg 

gggttccagc 

tggatccctg 

tgcttctcca 

ggcagatcag 

atgggcccac 

tcctgtgtct 

ctcagagggg 

agcatgatgt 

gtctttcagg 

caggcagaat 

aaaatgacaa . 

atggctcagg 

ggcatgtgtg 

ggatgttctg 

actgctcttc 

aggctgcctg 

taatgggtat 

agaaagcaaa 

agagacagct 

a gggtaatgg 

tgtccccacc 

cctggatgcc 

gcatagagaa 

acatctaagc 

cgaagctgcc 

agctggagag 

aagatccatg 

aactgcttca 

atggcaggca 

cttcagctcc 

caggcacgtg 

atatttgtta 

ggtagctgag 

tacacctccc 



TCATCACTGG 

tttctgtctt 

gtgaccattc 

gctgggattc 

gcacccagcc 

tggagttggt 

accctttcca 

gttatccatg 

tgggtfgtga 

aggtggtgat 

atcagctctg 

gtctcgtgct 

tccctccaca 

ggaactgggg 

aaagtgtggc 

agccgggagg 

taaaccccat 

tctggccaac 

ggttccactg 

aagagtcttt 

attcagggaa 

tgagagcaga 

tgcccgccac 

ggtggtggga 

accttcctgt 

caagataccc 

gcagaattcc 

agctgtgatt 

ggggttagga 

ggaaggtttt 

gtgtggggca 

tgcagtctag 

acactgagtg 

gagatgtctg 

tgagatgagg 

agagggagca 

tcagaatgcc 

tgacacagaa . 

gtacagcctt 

cca ggg ca gg 

gcagagagga 
ctaggtgact 
aatagtgagg 
aaaggcctgc 
ctaggtggat 



169716247 
169716297 
169716347 
169716397 
169716447 
169716497 
169716547 
169716597 
169716647 
169716697 
169716747 
169716797 
169716847 
169716897 
169716947 
169716997 
169717047 
169717097 
169717147 
169717197 
169717247 
169717297 
169717347 
169717397 
169717447 
169717497 
169717547 
169717597 
169717647 
169717697 
169717747 
169717797 
169717847 
169717897 
169717947 
169717997 
169718047 
169-718097 
169718147 
169718197 
169718247 
169718297 
169718347 
169718397 
169718447 
169718497 
169718547 
169718597 
169718647 
169718697 



FIG. 1.1 



20040411 93 A2J_> 



WO 2004/041193 PCT7US2003/034681 



2/154 



ttagattcaa 

tttaatttga 

gcagtaatcc 

tgaggaatgt 

atttactgtt 

caggatcacc 

ccatgccaga 

atttctcatc 

cacatatctc 

gtggtctgca 

ctagacttca 

actgtggcca 

ccgtgttaaa 

gcctgagatg 

cctgtcccac 

gcaagctgag 

acgtaaaaca 

cattaagatt 

ccccactccc 

ttctcccaca. 

aaaccacgaa 

ccggaccctg 

ctggcattta 

tggggttgaa 

agtagtgttg 

gagagcccct 

gcaccaggca 

cctatggcct 

ggcagggtct 

ctcactgcag 

tcctaaatag 

gactgtttat 

ctcctggctc 

acaggcatga 

tcccctctgc 

tcacctgatt 

ctcgcctgct 

ggtgaaatca 

agctcccctc 

cagaagttgt 

ttctgcactg 

tgagcagaag 

ccacccatga 

ctatctcaag 

cccaggctgt 

gacacactgt 

gactgacctt 

gagtaccaac 

accaccccaa 

caatggcaca 

tcagctcaca 

cttacatttt 



ccctacatgg 

tttcacttat 

aaccctcagg 

aaagaatgtg 

tcttaccagc 

ctccctgtcc 

aatccttctt 

agcgcttatt 

atcaaccttg 

ttccctttag 

gctccatcat 

gagctactag 

cgaatgcaac 

acccctggta 

aggtgctcag 

tctgcagcct 

ttcaggctgg 

gttcccagaa 

acccctctgg 

tttaaagaat 

tgcagtgagg 

cactgtggct 

gtttcctcat 

tgagattgtg 

atggatatga 

ccgggctggg 

cagattctaa 

tgtcagttac 

cattctgttc 

cctcgacttt 

ctgagaccac 

agagacaggg. 

aagagaccct 

gccaccacgt 

tcaacttgtt 

ccttattagg 

gtttaaccta 

ggggtctggg 

cacccgcctc 
gcaggctcca 
ggaacaaatg 
ggaggaatcc 
ggctgcgggg 
gacaaggccc 
tcctcttttt 
tcctcttgcc 
ggacccgggc 
tctgcctctt 
gccaaggctc 
aggctgtttt 
ggcaaatccc 
gtgccctagg 



atacattatg 

tcattattca 

agagggattt 

ttggaccagg 

cttgcaaata 

tgcctgttcc 

gccatttgtc 

gcagagataa 

aactttcctt 

gaacatgttt 

ggcaagacca 

catggggccc 

ggcaccgcac 

caggagcttt 

caaatgctct 

ccagcctcca 

gtctgagctt 

ggaaaggctg 

ctctattttt 

ctctttggca 

agcacagttg 

cggaagcctc 

ctataaaatg 

cagataaagc 

cttctgttat 

actccctctc 

gagcaacatc 

tgcatgactt 

cccaggctga 

ctgggctcaa 

acatgtgcac 

tctcactatg 

cctgccttgg 

ccagccactg 

ttcttaaggg 

agttaaaagc 

ggagaagccc 

gaacctgcgt 

acagacttcc 

ttctggctgg 

ttaccccagg 

aatccaggag 

gtggggaaaa 

tggagccgta 

tcgaggcaaa 

tcctcccact 

atggactctg 

accagctgtg 

cccactttta 

ttaatgtaat 

aactctgctc 

caccttcctt 



attgcttaat 
ttgcactgag 
ttctttgtcc 
tacctgcttg 
cctgggggac 
taacccttgg 
taagatgcca 
actgctcatg 
cctatccctg 
gctaaaggct 
tgtgcatttt 
agaatggagt 
ggcacagaca 
ctaggaagca 
ctgaagtctg 
gcctgcgtga 
gcatagtgtc 
ggtgggcagg 
atttgtttta 
ttaaaaaaaa 
gtttctagac 
ggactgggtc 
gaattacaat 
tccctgccta 
tagcggatac 
actcattact 
tcatgatcac 
tatttattta 
gtgcagtgat 
gcaatcctcc 
caccacaacc 
ttgcccaggt 
gctcccaaag 
gatgacttta 
cctacccaga 
ctgactccca 
agctcctggc 
cttcctcttt 
tgggggccgc 
ggaggtttaa 
cacagatgac 
agcttctagt 

ggggtgtggg 

actctcccct 
acctcagctg 
gctgttcaag 
ccttcaagca 
tgaagttgtg 
attggagata 
gatgatatga 
ttgcagggca 
gcctctttct 



gatttctgat 
ttctagattt 
cttgcagaac 
ttctatggca 
aaggagggtc 
ccacctcctg 
gctagcccag 
tccacctgct 
atcataattt 
ctcttcccca 
tttgctcacc 
atgtgtgtta 
ccagccgtca 
ctatgcaaca 
tttagctgca 
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acctttccat 
acaaaggatt 
aagcatctcc 
ataaataaag 
tttgattgtt 
aaatggtttt 
catatgccaa 
agagaaaagg 
gttgcccagg 
gcctccaggt 
actgcaggca 
tttagtagag 
tgacctcagg 
agcctgagcc 
ggactcaggc 



cgtacgcctg 
tgcgtgccag 
tgtactttgg 
atgggtagag 
agataagagc 
acgtgggcca 
ttttggagca 
agaaacataa 
aaaagcagtg 
actgtacatc 
catgaccaca 
atctcctttt 
tcctaacttt 
ggccaagtgc 
ttcaggcata 
gcagaagcca 
ccaaatgcca 
gaacacagtg 
ctgcatacca 
cagctcctaa 
gctgaggacc 
atggcttaag 
taatgaggtc 
aactctccag 
catttattta 
ctaataaaaa 
atttataaat 
ctgtacatag 
taacaagcag 
agtggttctt 
tacttgcagt 
tggctatcat 
gctagacgag 
gcagaagctg 
cttacaagca 
agtggggtaa 
ggccattttt 
ttttaattta 
catgacatta 
caattattaa 
ctgcatggaa 
tccttgggat 
tgatattctg 
attgccctgc 
gattttcttt 
gtggagtgca 
atcaagcgat 
ccaccaccat 
acagggtttc 
tgatctgctc 
gttgtgccca 
atccttgcag 



169765547 
169765597 
169765647 
169765697 
169765747 
169765797 
169765847 
169765897 
169765947 
169765997 
169766047 
169766097 
169766147 
169766197 
169766247 
169766297 
169766347 
169766397 
169766447 
169766497 
169766547 
169766597 
169766647 
169766697 
169766747 
169766797 
169766847 
169766897 
169766947 
169766997 
169767047 
169767097 
169767147 
169767197 
169767247 
169767297 
169767347 
169767397 
169767447 
169767497 
169767547 
169767597 
169767647 
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169767747 
169767797 
169767847 
169767897 
169767947 
169767997 
169768047 
169768097 
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ggcctcctgg 
ggaagctgga 
ttagtgtctg 
gattggttca 
agtctcctgg 
gtggtatcac 
ccacagctgt 
atggaaacct 
agagactagg 
catttcagcc 
ttttacttcc 
tgcatttata 
attttctaaa 
agcagtgagt 
ggatccgtgg 
atgaagagta 
tctggagttc 
gtaaatgtgg 
ctaaaatgga 
gaaatggcag 
ggcacaagag 
cagcaggggc 
tttctcctcc 
aggttcacac 
gggtctttgc 
ccaggccttc 
tgggcccaaa 
accccaaata 
caacccagca 
tcttgctcgc 
cagagcagca 
acagcctagg 
acaccacgga 
tgctagcggg 
tctgtgcccc 
ttcccacaac 
agatctaagg 
ggctgatact 
ttaaatgcta 
aatccttgaa 
ttaaagacat 
tggaaagcat 
tctgccttgt 
ccaaccacce 
acagaacaag 
aaacatgtca 
cactacctca 
gcatcttagc 
acaacttact 
cctcaacttt 
agaacagatt 
agataaaact 



cttctcttga 
atgaagcaca 
tggtcattgt 
ggcccctgtg 
accagtgatg 
agacgagctc 
gttaccagcc 
tgtgtgattg 
ggctagaata 
agacacctct 
tcagttcctt 
gttactatgt 
aatatttggc 
tggtgggaga 
gtctggaagt 
agctggagag 
aacacactca 
ccaagttact 
cataataacc 
gtagcaataa 
gtgagtggcc 
cagtaagctt 
cacatcagca 
cctctccctc 
acagagagga 
cgcattcatg 
tctgtgccat 
aaaaccccaa 
ccattcagca 
aagatactca 
tggagccttg 
gtctgtggtg 
ccagctaacc 
ggaagctgca 
atcatgttgc 
taaggtgggt 
atgggcagga 
tttatttcct 
tgtaaagata 
cacaaagact 
gttaatacta 
tgccctacag 
gttgtttctg 
aacacgcttc 
tgcccaatct 
tagcggattt 
tcttcagtgg 
tataggggtg 
ctgctgtgtg 
cccacctgta 
aagggtggta 
caaatgctag 



ttcattttgt 
actgggcaga 
tctctgacag 
atcttcttgt 
gtaacacctg 
ccatcccagg 
cctgctcaat 
gcaacaccaa 
cctctcccct 
tccacttcct 
gtttgtctaa 
tttcctccta 
tctaatataa 
gagattgggg 
taaaagagga 
cttgtgtgta 
gcctcaaatc 
tgactgctct 
catacagggt 
ttacacctac 
tgttcaaggc 
actagccctc 
tccctctctt 
ttaggctcct 
atctgcttcc 
aggtccaagg 
gagtgtcaac 
gagatgtggg 
gctttcaaga 
gcctatgaaa 
ggggtgattt 
cgcatggtgg 
atagacgagt 
cgtgccatgg 
gtaacaagaa 
gcagctgctg 
atttagaaag 
tctatccata 
999aggtttg 
attatcatat 
cagggaactg 
tatfctataaa 
tcatcatttt 
ctgagggcag 
gaaaatgtgt 
atgggtgtaa 
cgtagtgaca 
tccctcaaaa 
acttaagtca 
acctgggcag 
atatgagtgg 
ctgttgtcat 



tttcctggtg 
cccttccttg 
tcagttctgt 
ccatgtgaga 
caccctgagc 
ctggctctca 
catgtttcca 
gatttgtccc 
gcccgtttct 
caaaacttat 
aacaccagct 
gataacaagc 
gggaaattgg 
gtacgtttat 
ctggcccttt 
tagcatttga 
acctacccat 
gtgccttggt 
tattataagg 
tgaatttaaa 
agcatggcct 
atacccagga 
cttcctatgt 
tcctcattcc 
tgcctagagc 
ccaggcagga 
agaagatccc 
ggcatcagct 
ccaagaa'gtc 
tcctcagcta 
tgaggaactg 
gcagcagccc 
taagagtgac 
ttaaggaccc 
agacaaggat 
ggaatgggag 
gtaagggtac 
agtgtaacaa 
aaatttttac 
ttgtgactgt 
catttctata 
tttcagaaca 
acatttgcta 
ggcaaacttg 
cacattcctg 
attatactgt 
gtgaccttca 
agcatgggct 
aatagcttaa 
gataatagtt 
accatttaga 
ttggctgtta 



cctcatgcca 
ggcaaaagta 
ggtacataac 
ctggatgagg 

tggggagcag 

cagtcccttt 

tgtgattctc 

acatccagat 

ccgggcttga 

ctgtctcctc 

aactgataat 

ttcattgaag 

gatcaccatg 

ggaaaatcac 

gagcaaataa 

gggcaagggc 

tattggtcgt 

ttttccatct 

attttgacaa 

aagtggctca 

ggaagacctt 

aagcggttta 

cggccccaga 

gaggcctgga 

ttggagaacc 

ggcaaggcca 

aaagctggag 

tcctcagact 

catactgttc 

atggagtcct 

acactgggaa 

ttgcctgtcc 

agcatgctac 

cagggctggg 

tgcaggtgct 

aaacatttat 

tgtctgcctt . 

aaaaattttt 

caacaatccc 

tgaatatttt 

gaatgcactt 

cattttagga 

gtcttgactt 

gacattattg 

ctcaggttcc 

ttgctcaaaa 

aactatggtg 

ttgttgaagc 

tctctctcag 

tttagcttct 

gtagctcatg 

atgttatctt 



169768147 

169768197 

169768247 

169768297 

169768347 

169768397 

169768447 

169768497 

169768547 

169768597 

169768647 
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169768847 

169768897 

169768947 

169768997 

169769047 
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169769597 
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169770247 
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169770547 

169770597 
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gaaatcccac 
gttagataaa 
ctggcagaga 
tagatagagc 
ttaatcctca 
gctctcacca 
gagccaggtg 
ccaaactagg 
tgctttccat 
cctatccttc 
atgaaggtgg 
aacttgtatt 
gtcactgcgg 
cacagaaaaa 
tgcgtgggta 
gctctggaaa 
aatagcacac 
gccagtgagc 
gaggctctat 
cagcctgatt 
caggatgtga 
gccctcatgc 
actgtcctcc 
gacactcccc 
tgccctcagg 
actcatgtgt 
ccatcccacg 
ataagttcac 
aggcagtctg 
ttgtgtggct 
ggcccagcct 
cagcagggtg 
tggccctgtg 
tcagtgtgcc 
ctcctctggg 
tagatcagtg 
ctaagaggct 
taaaggccaa 
tctaagatcc 
ctgtgagccc 
cctcctgagt 
gggtccccca 
cccacacagg 
aggagatcgc 
cgcagacaca 
cacagcatgg 
caggggcacc 
ggcttttttt 
aataaaggca 
cccaggctga 
ataaatacaa 
ggaggaaggg 



agtagtcaat 
agatggttac 
actgtttatg 
agggccattg 
gggaatctcc 
cgggctgact 
tctgtgattt 
agggggaaaa 
gtgtcagaca 
tcgggcgatg 
tggtttgggg 
ttaaataccc 
cccttgtcaa 
gagcagcaga 
tttgaaggaa 
catttcagtc 
actccccagg 
aaccctcagg 
tctagggacc 
actgatgcac 
acactggtct 
catctcctga 
ttggctggga 
tgggaccacc 
gattagacct 
ccagttgctc 
gctttgctct 
agaggcaggg 
ggttccaatc 
tatggcccgt 
gtgccactct 
tcttatgcca 
tgctgctaag 
tcagtgtctt 
ttgtcatgag 
cctaaagtct 
agctgttagc 
atctgaatcc 
agtgtaacta 
ctgccagcca 
cccccactca 
ctcaggggat 
ggatcgcaga 
agacacagtg 
gtgcaggtgc 
gtgcaccggg 
tgcatgttgt 
tttttctttt 
ttttaaatag 
gctctgcacc 
acaatggccc 
agaaaattaa 



agtggatggc 
tagatgatag 
gaagagttct 
gagggagacc 
atggctcagc 
gaatgagctc 
ccaagagcgt 
acacctcgca 
gagaggaaat 
agagaaagta 
tatttggtat 
cctggcttct 
aaaaacctta 
gtgaaggctc 
gaaggaaggt 
ctggaacacc 
cactgcaaag 
gctgcagagg 
caacaatcag 
ccgagctcac 
ggctgatgct 
agttgaaatc 
gtgggttcat 
tgccttcagg 
ggggtcacaa 
tcagcaagcc 
tgtaacttgg 
ttgtgcagta 
cacccctgcc 
tccagagctc 
actaagccat 
gttctacaaa 
agctgaggaa 
tccctgtaaa 
cattcaaggg 
ggtaaatact 
atttgactgg 
atagtgtttt 
tcaggaatcc 
tcccgtctca 
ggggatcaca 
tgcagacata 
cacagtgaag 
aaggtgtacc 
acctgggtcc 
tcccccacac 
tcttgcaccc 
ttagctcatc 
gccctgtgga 
agaggtcagg 
atgtctctgt 
attataatgg 



attaattgga 
aaggagattc 
tggctcgtaa 
agttgcttga 
cagatcctgg 
attgtcatgt 
gagcagaatc 
gaagcacact 
tgtccaattc 
attggcatag 
ttcttattgg 
tgaagaacgc 
gctgaaaaga 
agtcaggcgg 
gctggaggga 
tggtctctaa 
tgggcaagca 
cgggaggggc 
tgactgttca 
acagacctaa 
caagactgtg 
tcaaacttgg 
aatttacaaa 
gtcttgaggc 
tgtgtgcaat 
aacaagttca 
gaacaaacag 
gtgacacgca 
accaggatcg 
caggccaaga 
ggtccctgga 
gtattcacat 
tttggttgag 
atgggcaaca 
gttaataccc 
caaaaacaag 
catttttaag 
ccagttttat 
tgggcaaagc 
tcagcctgcc 
gacacagtgc 
gcatgggtgc 
gtgtaccagg 
aggtccccca 
cccatgcagg 
aggggatcgc 
tgaaatggaa 
ttctgttttg 
aagggaactg 
gcaggattaa 
gggcaatgtg 
gaaaatataa 



ggagggaatt 

aaattattga 

gagcctcagt 

atgcaagctg 

tctgtaatct 

aatttaatag 

atcctgcacc 

tgtccctagc 

caaatgcatt 

tttgagtgaa 

cataaaggaa 

tttctctgct 

aaaaagaatg 

tgggtctttt 

aagtcaggct 

aggtgccatc 

ggtggggttt 

agtacgggga 

tctccagact 

gaggcagatc 

ccattcctct 

ctgtgtcacc . 

aatccatcag 

ctcaaaaagt 

gtgggtgtga 

agtcacttct 

taccccatga 

caggctcttg 

ccatgtaggg 

tgacaagggg 

attccaccga 

gggcgaggcc 

ttacataccc 

taatgtattt 

agaaagaact 

tagtaaatat 

tgtgttttgc 

atgattttgt 

catcagagtg 

ctgtgcaggg 

agggcctcct 

acctgggtcc 

tcccccactc 

ctcaggagat 

gtattgcaga 

agacatagtg 

taggaaggct 

gaccaattaa 

aagttgtttc 

taaggaagct 

ctctggtttg 

attatttgga 



169770747 
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169773297 



FIG. 1.22 



_2004041193A2_I_> 



WO 2004/041193 



PCT/US2003/034681 



23/154 



tcaggataaa 
ctgtcagaag 
aggaggatgg 
tcctgccttt 
ctgagatgag 
agatggtcac 
accctcccag 
gccctcaagg 
agcgatgcta 
ctccttggta 
ctagggggca 
ggttcaaaga 
tggttcaaac 
gtgcagacaa 
aaagagagag 
tctgtgtcaa 
ctaaaggaaa 
attttttttc 
aaatctatat 
cagagagggt 
agcttcagtc 
aagtaacagt 
gtaggtgagt 
aaggaaggct 
aagattccag 
tactagttcc 
gcatgctggc 
acggttgaga 
tgaccactga 
gccccaccag 
tgcaggcaca 
gagaaaagat 
tgagtccaac 
gcctctggac 
tggccctcca 
aagtgactcc 
acttctggag 
acaaacaaaa 
tgggtggaga 
gagggagctt 
agatgttccc 
atccttatct 
accctattca 
acctgccgcc 
tttatttgtg 
gttgtctgtt 
gcgtagtaag 
gaggaaggaa 
cattagcttg 
gccgacctgc 
ctgggtcttt 
gacactttca 



aaggaagaaa 
agttgtgagg 
tggaacaggg 
tactgttagc 
agatgattgt 
ccacccagtg 
gtactggagg 
agcacccaga 
attgattgca 
gcacttgctc 
aatgtcatta 
tatgagggga 
ccaggggttc 
atgttctgac 
attgtgtgtg 
cctggaaggc 
ggtcttaaat 
ttttttgagc 
ttaataagca 
tgcagacact 
tagctggaga 
gtcaataaac 
ggaggtcaga 
cccagggtag 
aaagggtgca 
cccatgctta 
tgaatgaata 
gatgaggcat 
gcctctaccc 
tcagatgacc 
tctgtgggag 
ggg a 9 aa 9 c c 
atgttagctt 
attcaagtag 
agggggaaag 
tgctatttaa 
atttgcccca 
accagcaaaa 
aaactagaat 
gctttttctc 
tctgcctgaa 
ttgctgaaat 
aattttcttc 
ccaggcaccc 
tatctcctgt 
ttgcacacag 
tgctcagtga 
tgatggtggg 
agcaccagag 
attgcggagg 
cagggccctc 
aactcatcgt 



tcagtggtgt 

ttggctcagt 

ccccacacag 

catgccccac 

atttgtcaga 

aatgtgattt 

tttagaggga 

catatcatgt 

gccttgcatg 

acttttattt 

tgatgcccat 

cttcctcaca 

gtaccttcaa 

agaggacagg 

tacatgtgtg 

ttcttggagg 

gctacactaa 

agaattcaga 

actccaaggt 

taatataaga 

gatgaaacac 

caatgccgag 

a 9 a 99999ca 

atggagagag 

ctttgagggc 

gcagagggcc 

aatgttggac * 

aacaaacgga 

agcccaccaa 

ttccttggct 

aagcaagtag 

ttgggtgaca 

ttttcctgca 

caagaggtaa 

aggaaaaatg 

aacgtgcaaa 

agatcgtgga 

tgcctcctgg 

gcagctttct 

tgctgcctct 

atgcttttcc 

gcccccttct 

tgtagcccca 

agcctcgtga 

ccctctggaa 

gcagtatcct 

agatctgttg 

taaatcggcg 

agcagaccaa 

actgcggggt 

tggctggggc 

gcctggcctt 



ctgaatgaat 
gtaggtgcac 
acgcaggtga 
ctgtggaatt 
ttttctatca 
tagcaccttc 
aacagagcca 
ggccaagatg 
gacctggcca 
catggtacct 
tttacagata 
gccaatatgt 
agaccaagtt 
caggggacag 
cacatgtgta 
tggtgatgtc 
ggaatttgga 
catgattata 
gccagacaat 
tgtagtctct 
agatttattt 
cgccaaataa 
gtctgggctc 
agaggttgga 
tggggcctag 
tgcccatagc 
ccaatgaaaa 
ctgtgggttg 
tacagtttag 
ttccattttt 
actctatagg 
gaaggaggaa 
ggcaggaggc 
atgatttggg 
ctataggaga 
ccctttcatc 
acaagtgtga 
aggtccttgc 
ccaggggtgc 
cacctcaggg 
ctgccaggtc 
cagcaagacc 
ccccctcatt 
ctatctgaca 
tggaaactgc 
cagtgctaaa 
aatgagtgaa 
agtgtgtggc 
ggcaggtcct 
cttggtagat 
caggaggcac 
ggcactcggt 



ctcagtgaat 
aaagatgcct 
aactccagag 
cagtgctatt 
aagtaaatat 
cctgttgcag 
gctctatgtt 
attgcatcca 
ggtgctacag 
cctctccccg 
aggaagctga 
ggcagagtca 
ctcatccact 
agagtgagag 
tggcagtggc 
tagtggaatc 
ttttatcaac 
tccccccaac 
cttctaggtg 
gccctttcct 
agaagataat 
aagtcagtga 
tgggttggtc 
9999ttatgg 
tgtttggcag 
aaLgtgacggt 
gaaaggacaa 
cagggatgtg 
ttgacttgtg 
ttttaggtca 
ggaaaagcag 
gccttgaatg 
agccattgag 
gagattgatc 
gggtttcagg 
cagaaatgtg 
agaaacaaaa 
agtgctgtct 
a 9tgcagagt 
cctctgcacc 
acatggctca 
ttccctgatc 
tacaccacac 
tacccttcat 
agtggcgatt 
gcagcgctgg 
tgaaggaagg 
ctacacagga 
tggttcctgt 
gaagcctggc 
agctgcagtg 
gggctcattt 



169773347 
169773397 
169773447 
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ctggtcataa 

acagtgcaaa 

agtcagaaga 

ggtgtgacct 

gaatggtacc 

agcacggcga 

acttttacac 

tcctggcctg 

tggtctcctc 

ttgacccact 

aaccacaacc 

tatctgctac 

agagcaagtt 

tgtagaaggc 

tggagtgtca 

attctgtaaa 

caggcaaggg 

cttatatcta 

gctttttagg 

tccttacata 

tatagcaagt 

atgagttttg 

agggcaaatg 

tgttctctgt 

acgtggctgc 

aattcaaaat 

tgaagttttt 

ggatccaagt 

aatcattact 

ctaaaaattc 

acctgggctt 

aaggaagctg 

tccaatgccc 

tttcaatgat 

tgttacctta 

tttttctctc 

cccacgagga 

gaggaagccg 

tgtgtttgcc 

cttggatccc 

cattttggat 

tgacatttag 

ccccaccccc 

gtgagaacca 

ctgaacagtt 

atggtccaca 

tgtctccaag 

tctgcccctc 

gattatctct 

ctccagtgga 

agtgacgtct 

gaacactcca 



tgacaccttt 

tgctgcagct 

gatggactca 

tgcacttctc 

agtagctgtg 

gagatttttg 

ataaatatgt 

ccacatggaa 

caccagccca 

cagaagccac 

atttatgaat 

cattcctgag 
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aacctggtag acatcctggg tggctagatc cagaagagag ataacaatca 169820147 
ctacagctcg gctctcagga agccacatcc ctaggaaaag ggggagagta 169820197 
atacatcaag ggaacacccc atgggacaaa agaatctgaa caacagcctc 169820247 
gagccctagg ccttccatct gacagacact acacaaatga ggaggaacca 169820297 
gaaacccaac tctagtaata tgacaaaaca aggttcctta atacccccca 169820347 
aaaaatcaca ctaggtcacc agcaatggat gcaaaccaag aagaaatccc 169820397 
tgtttcacct gaaaaagaat tcagaaggtt agttattaag ctaatcagga 169820447 
aggcaccagt gaaaggcaaa gtccaactta aggaaatcaa aaaaatgata 169820497 
caacaggtga agggagagat tttcaatgaa atagatagca taaataaaaa 169820547 
acaatcaaaa cttcaggaaa caatggacac agttatagaa atgcaaaagg 169820597 
ctctggaaac tctcagcaat agaattgaac acgctgaaga aagaacttca 169820647 
gagctcaaag acaaggtttt agaatgaacc caatccaaaa aagacaaagg 169820697 
aaaaattata agaaaatatg aacaaagcct ccaataagtc tgggattata 169820747 
ttaaatggcc aaacctaaga ataattggca ttcctgagga agaagagaaa 169820797 
tctaaaagtt tggaaaacat attcggggga ataatcaagg aaaactcccc 169820847 
cagccttgct agagccctag acatccaaat acaagaggct caaagaacac 169820897 
ctgggaaatt cattgcaaaa agatcattac ttagacacat tgtcatcagg 169820947 
ttatctaaag ttaagatgaa ggaaagaatc ttaagagctg tgaggcaaaa 169820997 
gcaccaggta atctataaag ggaaacctat cagattaaca gcagatttct 169821047 
cagccaaaac cctacaagct agaagggatt ggggccctat cttcagcctc 169821097 
cttaaacaaa acaattatca gccaagaatt ttgtagccag tgaaactaag 169821147 
ctttataaat gaaggaaaga tacagtcttt ttcagataaa caaatgctga 169821197 
gagaattcac cactaccaag tcagcactac aagaactact aaaaggagct 169821247 
ctaaaccttc aateaaatcc tggaaacaca tcaaaacaga acgtctttaa 169821297 
ggcataaatc tcacagaaac tataaaacaa aaatacaatt aagacaacaa 169821347 
caaaaaacaa ggtatacgga caacaaatag catgaagaat gtaatggtac 169821397 
ctcacacctc aatactaaca ttgaatgtaa atggcctaca tgctccactt 169821447 
aaaagataca gaattgcaga atggctaaga attcaccaac cagctatctg 169821497 
ctgccttcaa gagactcacc taacacataa ggacacaaat aaacttaagg 169821547 
taaaggggtg gaaaagacat tctatgcaaa tggagtaact attcttatat 169821597 
cagacaaaac aaattttaga gcaacagcag ttaaaaaaga caaagaggga 169 821647 
cattatataa tgataaaagg ccttgaccaa caagaaaata tcacaatcct 169 821697 
aaatatatat gcacctaaca ctgaagctcc taaatttata aaacaattac 169821747 
taatagacct aagaaatgag atagacggca acacagtaac agagggggac 169 82179 7 
ttcaatactc cactgacagc actagacagg tcatcaagac agaaagtcaa 169821847 
caaagaacaa tggatttaaa^ gtatatcctg gaacaaatgg acttaacaga 169821897 
tatttataga acattctatc aaacaactgt ggaatataca ttctattcaa 169821947 
ctgcatatgg aactttctcc aagatagacc atatgatagg tcacaaaatg 169821997 
agccttgata aatttaagaa aattgaaatt atatcaagca ctctgttaga 169 822 047 
ccacagtgga ataaaactgg aaattaaggc caaaaggact ttctaaatca 169822097 
tgcaaataca tggaaattaa ataacctgct cctgaatatc actgggtcaa 169822147 
aaatgaaatc aaaatgcaaa tttaaaaatt attcaaactg aatgacagta 169 822197 
ctgacacaac ctatcaaaac ctctgggatc cagcaaaggg gatgctaaga 169 822247 
ggaaagttca tagccctaaa ggcctacatt gaaaagtctg aaagagcaca 169 822297 
aatagataat ctaaagtcac acctcaagga actagagaaa caagaacaaa 169822347 
ccaaacccaa attcagcaga agaaaggaaa taaccaagat cagagcagaa 169 822397 
ctaaatgaaa ttgaaacaaa aaaattacaa aagataaatg aaacaaatag 169 822447 
ctgattcttt gaaaagataa atacgattga tagaccatta gcaagattaa 169 822497 
ccaagatagg aagagagaaa atccaaataa gctcaattag aaatgaaatg 169 822547 
ggagacattg caactgaaaa ccacagaaat acaagagaac attcaaggct 169 822597 
actctgaaca cctttatgtg cataaacttg aaaacctaaa ggagatggat 169 822647 
aaattcctgg atagatacaa ccctccttgc ttaaatcagg gagaattcga 169 822697 
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cctccttccc 
gcttttccaa 
aagggatgtg 
tattgagcac 
gaagaaacaa 
tggagtcaga 
aaagagagag 
ctgaacctta 
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atagaagtat 
tgaaataagc 
tcagagtcat 
aattgacttt 
agatttggaa 
atgaattaga 
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gtttacaatg 
aacatttctc 
tataaataaa 
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tgggtggata 

agaaacctta 
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gccccatttc 
cagaaatgtt 
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tttgggagct 
cctgggcaac 
tgagcatggt 
aggaggatca 
cgccctgcac 
acaaacaaac 
aagaataaag 
tggcctcatg 
atcctctcac 
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ccatcttttc 
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ctctctgtcc 
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ataacaattc 
gacagcagag 
ttgattgagg 
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ggcagctcat 

ttttaaaggt 

tggagactat 
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cagactttcc cagccaggac tcatactatt tttaaacaca tacacctgct 169853947 
tcggtgatgt actgaaggcc tcaggctact ctccccgccc acatctccca 169853997 
cccctctaac aggattcaaa tctccagtgc tcatatttcc tcctcatcac 169854047 
ttacctagca aatctggctc acagtataat atcagtcatt tatacttcag 169854097 
tctgaattcc aggtcagtgc caacttcctc cagatatttg tggtttaagc 169854147 
ttcatgtgat attacgtcat cttaagggca tttagtaaag gggaagaagg 169854197 
aacatttttg gacagaggcc ttttcctgtg ctcatttgtg cctcacagga 169854247 
agccaagagt aaatgtgcat ctagacctac aggggaggtc ttggcagtta 169854297 
cagcaaactc agggaatttt aaatatgccg gaaactttcc agagccctga 169854347 
gttgcagtcc cagctctgtc actgtctctc tgtgctttgc taaacagact 169854397 
tcgcccctcc gtgggccgtc tgtgaaaata gggcccagct cgatggcata 169854447 
taaaggccct ctgacttcca gtagtaaaac tctagaggag cccttgtttt 169854497 
ctggggagac tctttctgag ttttcataat aatgttgagt agttctcttt 169854547 
tcttgctcac gccgactttt ctccagttct cggacaccag ttctgggtac 169854597 
ccaacaattc aatccagttc tgacactaac tatccagagt tagcagaggc 1698 54647 
cccacaggtt aaaggcctcg tcccacaaaa ctgcccccac ttcagacacc 169854697 
agtggccagt ctcagacccc ctgtacttct gaccaaccag ctataagtca 169854747 
aggattccta agactccttc ctcaggtcta ataatttgct aaagaagctc 1698 54797 
acagaactca ...ggaacacact ttacttatgt ttactggttt gttttcaagg 169854847 
atgcaactca gaaacagcca aatgaaatga ctgaatcagg caagatatgg 169854897 
ggttggggga tgaggcatcg agtttctatg ccctctccag gcatgccttt 169854947 
ctcccagcac ttggacgcgt tctccaacag ggaagctctc caaatcttgg 169854997 
tgcttagggg tttttatgga ggttcaatta cataagcatg attgatcaaa 169855047 
ttattggcca atggtgactg actcaatctc cagcctcttc cttctctcca 169855097 
gaagttgggg atggggctga aagtctcaag cttccagtca aggcttggtc 169855147 
tttttggtga ccagcccaat cctgaagcta tcaaaggctc accaagagtc 169855197 
acctcactgg aacaaaagat gctcctatac cctcatcact cgggaaactc 169855247 
caagggtttt aggggctttg tgctaggaat gggagacaga gaccaaatat 169855297 
ctatctattt atttatttat ttatttattt tttgagacgg agtctcactc 169855347 
tgtcacccag gctggagtgc agtggcacaa catctgcctc ccaggttcaa 169855397 
gtgattctcc tgcctcagcc tcccaagtag ctgggactac aggtgcccac 169855447 
caccatgcct ggctaatttt tgtattttta gtagagacgg ggtttcacca 169855497 
tgctggccag gatggtattg attccctgac ctcatgatcc aaccaccttg 169855547 
acctcccaaa gtgctgggat tacaggcgta agccacctcc cccggcccct 169855597 
ttctatgaca ccacagatgg gcacagtcag aatcagagcc tgaggtgcag 169855647 
gtcaaatccc ttagccctcc accagtcccg cccacttggt ttctgtggta 169855697 
gaaagttttg tttactatct gtatctgtat cccccaccaa atgcgagctc 169855747 
ccccatccca ccctgatgct gaactgagga cactcagggc tttgtgtggc 169855797 
tctggcagct tgcacaatgg gttctcagat gaggtattcc atgcccttct 169855847 
tcctggagta gtaagagtca atgcttttca gttcagcaga aattcatgag 169855897 
atattagaaa gcacttccta gatctgcgta gaagttggaa taggttgcat 169855947 
atagaagcta ggaaggttcc atccctggag atctttagga agatcccatt 169855997 
gctctgggtg actcagtcgc tcatttcaat catgaattac aggcattatc 169856047 
gagttaagtc accatggtgg ctggacctgc . tttctccaaa gtccctccag 169856097 
ctgcaggttg gtgagcccat agatggggtt agtgaagcat tgttggtcag 169856147 
ctcaccaggg cacccttatg agagtgaagt agaccgtgac tcctctgcag 169856197 
ttggttaagg aagaggacac tgaacaaagt catgtgttaa gagtggtgtt 1698-56247 
tggctggcaa attctgattc atctttcagg accctgctca agcatctcct 169856297 
cttttgagaa gcctggcctg gccctctccc catgccccac tcttccttcc 169856347 
ccaggtttaa ttaaatgtct cttctcagct ggtgtgacct cttggttgat 169856397 
tgcaaatagt gcatactcca ctctctctcc acggaactgt gagctttcta 169856447 
agggccagaa ccatacccac ctcatctctc actgcc.tccc tggtgccatc . 169 85 6497 
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aagacaccca gggtggctta actttctcct cagcaaagag ctctcctctg 169864347 
ttccctgaat cctggatatc ccactgggtc ctctagtgac cccaagcttc 169864397 
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AGGAGACCAC GACAGAGGCC TGGAACCGAA GTTCTAATCC CACATCCTGG 1698 64647 
CTGGGCAACT TCAGGCAAAT TTCTAACACA AGgtaagcct caatttctct 169864697 
ctggggtaat gatcaggcac ctgcttaatt cacaggggtt tggtgggcat 1698 64747 
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tgtgtacaat agcaatgtct gtgcggactt aaggacctgc gcccatatgt 169865197 

ttgtgggact ttctgggcat gcatgcttgt ttatgaggcc atacatccgg 1698 65247 
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accactgcac tccagcctgg gcaacagagc gagactccgt ctcaaaaaaa 169865947 
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ggctcctttc tcctccccaa ctccctgtca ctagaaggtg ggccctgcca 169866047 

taggaggatt ctgcagaacc ctcaaggacc cgcggagcag gacggcacct 169866097 
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aagatgaatg attgaatgaa tgggggaagc agtaaaggag tgaatgaatg 1698 72147 
aagcaacaaa tgaattatca actgaatgac agaaagaatg atcgaataaa 169872197 
tgaccacatg aatgaatgaa tatgtccata aatgaatgaa tgatgaagtg 1698 72247 
ccagaacagc tataattaac agcctatctt ttcacaaaga gcattgatag 169872297 
tcatctcccc aaagttgaaa aatataaatt cagtcaagaa gagtcctggt 169872347 
aattttttgg atactggtag gtgtatagtg tctttggaga aggtgaggag 1698 723 97 
atgtctctaa gttttcactc ttttgctgaa acaaaagcag ggactgactc 169872447 
acagtgattc cccttgtctt cccttatatt tcaggttcgt tcacccttgc 169872497 
ctgggtgaga tcagaatctg tatatttatc cactcattta tttgttcact 169872547 
tagccagtcc cacaattgtc cttcagcatt gcttccctgc ctaagatcca 169872597 
ctctggcttc ctcatcctcc ctcttcctcc atctcctcat tcacctggcc 169872647 
aactcctact tatctttcaa ggcctcattc aatgccaccg ctctgggaag 169872697 
cctttcctgg cttctgtagg cagagcacat ctctgccttc tctctttccc 169872747 
catagctgct gctgcttccc ttaactacaa ttctgaacac actatgaggt 1698 72797 
tgtcattatg ggtttacaca tttatctctc tgaotagctt tcaggcactg 169872847 
agtcttatcc cagggcctct gtcaggcctg acactgcatc caggcttgat 169872897 
ggatgctggc ggcatggagg taaggatgga tgggccagag tgtgcgttct 169872947 
tgatggctct gagcggtgtg agaatgcggc ggtgcagatg aggaatccag 169872997 
gtctgctcct ggggagaagc tctgcaaaat tctgcataat ggccgtggag 1698 73 047 
ctgatgttcc cattcatcag gcagggaagg aacagttttt agaaaagaga 1698 73 097 
gagagagcct aagagtccta gggccttcag agaaacatct ctttaaatgc 169873147 
ttccccttcc tccgggtagt attttaaata ggtcaccttc tggaagtagt 169873197 
ggtagtcagt tgggatattg ctagcttccc gtaccccagc catgggagta 169873247 
gtactttgga ttaaatcttg ttattcaagg ccagactaaa agcctcattg 169873297 
gtgcggtggg gggaagtagg aaccacattc cactcaaatc aggaagctag 1698 73347 
aagtaggatt gcattatggc tatgagcacc gcgttaaaat aagagtcttg 169873397 
ggattgaact ccctttcatt ggttcatagc tgagcctttc caggcctcag 169873447 
tttcctcttc tataaaatgg ggacagtggg ccaggtatgg tggctcacat 169873497 
ctgtaatccc agcagtttgg aaggtcaagg tggacagatc acgaggtcag 169873547 
gagttcaaga ccagcctggc caatatggt'g aaaccccatc tctactaaaa 1698 73 597 
atacaaaaat tagccgggca tggtggcatg ctcctgtaat cccagtcact 169873647 
caggaggctg aggcaggaga atcgcttgaa ccgaggagac ggaggttgca 169873 697 
gtgagccaag actgcaccat tgcactccag cctggatgac agagcgagac 169873747 
tccgtctcaa aaaaaataaa aattaaaaaa tggggacaat gacagtgtct 169873797 
ccctataggg ccacgtgagg gccaagaagg aattgaacat ggtgctgcac 169873847 
accaggtgag ttggcagtgc tggctttgct gctgcctttt gtcccatctc 169873897 
aggacaggcc ccttccttct gccttgctcc acttggatga tgacggtgtt 1698 73 947 
acccttggta atctccccat ccctttgagc tgtggcctct gattccagcc 169873997 
ctctgagatc ccgagccacc atatccatgg tcactggcat tctaagcagt 169874047 
ttctctggtg attcctaagg gaagctgtca actttagatg gtaaaagcaa 169874097 
ttcccaacag tgaggtatga aaatgcaggt cacccgccag gaaggataat 1698 74147 
aatactagct caaatgactc tggcctccac ccagtgccag gcaccttgct 1698 74197 
cggctctttc ctgtgttatt ttattaaccc caaccctgga ggacaagatg 1698 74247 
ctattacagt tctcattttg cagttgagga acctgaggct ctaaaaaggg 169874297 
aagtgcctta ctaagggtca tagagcttgg gaatcgcaga gcctggttct 169874347 
ggttgatccc agaccctgag cttctaactg ccatactaac ctcctctcct 169874397 
cctggctttg cctctggaac cttagggacc tgagtagaaa aacagtcaaa 1698 74447 
acgggtgcat ggatgttggg gagaaggcat tcaaggatag actgaagcac 169874497 
ctcattagcc acacggaaca actgagctac gtgcaggccc tgagcaaaat 169874547 
gcttttagcc ccatcccaac cccgcctccc cctagatctg tttccatttc 169874597 
atctgacact gtggaagcct gaatatgaaa tgattggggg cagaaggcca 1698 74647 
attagcaaat tcattaatag gattaaacat ctctcctctg ctctaatcat 169874697 
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agcatgaaga 
gcactctatg 
tgcgactcta 
gtgcatgtgt 
acactgtgct 
tcctatgaag 
gaagctcaga 
agtgcaggga 
gagattctac 
attaagcccc 
aatggtagac 
ggggcctcgc 
tccttctctt 
cattataaac 
caaaaacatt 
aaaccctcct 
gccggactct 
ttccttcagg 
• cgggcagtgc 
; tggataagtc 
v agagctaatg 
atatgtgaaa 
. tgggttgtta 
tacaggttcc 
tgcctggcat 
atccctcatt 
tctttcttaa 
acatggaaga 
...tgttcactgc 
cacctgcttt 
aactcctctt 
gtagaaatca 
ctggggatct 
tcaaagttag 
gccaccttta 
tctttccaaa 
ggtaggccta 
aacaaataaa 
tcattcatat 
tcagggacct 
cctctacatc 
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cctaaggcat 
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gatcacagcc 
cagctgggtg 

tggtcatctt 
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tgccagcaaa 
gggaattcgc 
agcagcctcc 
cagcccaatg 
gtttccagct 
ggaagagtac 
gagttttccc 
gaacaaagga 
aatgagtctg 
gagaccccag 
gatttttttc 
atgctggtgg 
aagtggtgtg 
tctgacaggt 
tcctgatctt 
gctgaactgc 
gctttctgcc 
tgctggagaa 
ctgcagccag 
cattcctcct 
taaagctaag 
cttacaggag 
cctgcacgtg 
atatgcggat 
gctgggcatc 
agccattctc 
agccagacat 
aacctgctag 
gtcagttcta 
accccccagt 
ggctttcttg 
ggaccagcct 
tacacattcc 
ccaacttggc 
gctccagcac 
tgccacatgc 
. catacatgtg 
agctacactg 
gacccaggac 
aatgtagaaa 
gattcaatga 
tggccatctt 
tgtgaaatcc 
attcagttta 
cctgagagag 
gttcttggtt 
ctgggtggtc 
ggattcctcc 
gaagaaaagt 
gacatttgac 
aaactttgcc 
actgggtgcc 



atcaggcttc 
caaggtaaat 
ccagggcacc 
gcatttggga 
gcactgggac 
agcaggaagg 
tctaaagagg 
atcttggcgc 
acactcgcca 
gcaaccaagt 
tctctaggtc 
gaacaggtcg 
ggcagctccc 
gcccaaggta 
gcgtgctgca 
taggcgatgc 
aggcataatt 
tcctgggctg 
tggctgattc 
cctaagactc 
ggaggcgggg 
ggggtcccca 
tcacctccct 
gacaccaagt 
tccctcacgt 
ctcctacccc 
acctaacctc 
tcctttccca 
cagcagatgt 
cctccggcac 
tggtggcctc 
cccactgcct 
caaggttgga 
cccttctgac 
tgcatcccct 
acttctccca 
tggaagggag 
tgtgggtttc 
aagtcacttg 
taatagtaat 
catattcatg 
atcaagtgcc 
aagtgtgggt 
aactagttca 
gagggacctt 
ctgaggcccg 
aagttccctg 
ccaccaaacc 
gtgcaactca 
ggtggcagcg 
taagatttag 
tgctctgaga 



caagccccca 
gccaccttgt 
attcctgtct 
gccacatcca 
attctggcag 



agaggaagca 
acctcctgag 
tttgatgatc 
gtgccttagt 
aaactgatta 
ataaataggg 
agatgggctg 
cctgccccac 
gagttcagtt 
gagagctgtc 
tctgggaagc 
attgcatgga 
gtgccagagt 
tcagagtcga 
ctacctggct 
agttgatgag 
gagttctgac 
gggacaatcc 
gtctttccag 
gtctcctcca 
agtctgcctc 
ccagacctcc 
ctccctgtgc 
gtcttggctc 
agcccaggtc 
ctccctggcc 
ccagatccaa 
gtccttccat 
tgtcctgaag 
gccctgcttg 
gttgtccaat 
gaatgcaagg 
aatcttagct 
gcccctctgt 
aacagaaccc 
taaaagactc 
tgctgcctgt 
ggtggagggg 
tgccttcatt 
acccatggcc 
gctgccctgg 
agttctttga 
cctgactctg 
gctcagcttt 
ttaggagggg 
gccctgggct 
Qcaggatacg 



aggctcactg 
gaggtctttg 
cagcatgagg 
caccccacgg 
attctccctg 
gaaaaaaatt 
ccacttctga 
gtgtctgcag 
aggaattata 
gctactgctc 
ctgtgggaga 
tacattcagg 
cctgccaccg 
ccctcggcca 
ctcagagcag 
cctc'cgccag 
taattacacg 
agaaaggcag 
ctttgaaaga 
gggccaggac 
ttctctcctc 
ctaatccctc 
acccctgccc 
ccaggcatct 
gcccattgca 
tcagtccata 
actaggaacc 
accatccgaa 
ctgcctgttc 
tgggctccat 
tccccaccct 
tgtcttccag 
ggtttccttc 
acccttccca 
tgacacccaa 
gtggctttgc 
gtgaatgttc 
ctgtcacttg 
tattctttct 
atccatgggg 
agagcatgac 
cccaagatgg 
tggtatgggg 
ttacagaggg 
accccaggac 
catggagcag 
gggctgacta 
agaagtgggt 
ccagggtaaa 
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cagcctgatg 
tggaaagaac 
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attcagagcc 
ggttccggaa 
agagggtaat 
atggcaaacg 
cacccactcc 
ttctttactc 
attagcacat 
gcaaatgcat 
ctccagcctg 
ttctcttccc 
caggagtcct 
gccaggtcat 
tcccagcacc 
ggtctgccct 
catcccctgg 
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cggggcggga 
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gagccagctg 

tgtctctgca 

tgcctgggtc 

accttcctga 

acctgcccca 

tttaggtagt 

gcaagaacca 

tccctttgcc 

aaccctacat 

ggagcctgtg 

ttgcagtgcc 

acaggcagga 

cagagccatg 

gactctgtct 

gtcatgttag 

ccctgtgcag 

gatgccagca 

tacaacctcc 

gagcctcaca 

tcgcctgtgg 

acctggccat 

taaagttcaa 

tgcttagaac 

gttgtcaata 

ggaggctaca 

cctgtcgggt 

actgccatta 

cacccagatt 

gagatttgct 

tgggaggaga 

tgctcagcct 

taccttttat 

cgcgctctgt 

caacctccac 

tagctgggat 

ttagtagaca 

gacctcaggt 

gtgtgagcca 

gacctggtac 

catcacaagg 

gctcctaagc 

agagtccaag 

aattaaacta 

gagacttgag 

gagacagggt 

atcaggaaga 

attaaaaagg 

tgcgaacggc 

atcagccact 

ccttttaagc 

agtgattttg 

gtgggagagt 



tcctgggctt 

ggcctctagc 

ctcaccccta 

gcctcagttt 

tagagttgtg 

gcctgagatt 

agcactatca 

ttccctccct 

acatctttgc 

acctttaccg 

agccaaacat 

gtgttctgga 

tgatgctgca 

ctgacatctc 

ctgcgcctcc 

ggcttctgtg 

ccaaccaggc 

cttccaatat 

tcgctggacc 

ctgggattga 

gctcatgtca 

cctacatgct 

agtgcctggc 

tcaactctac 

ggaagggaag 

tccagtctgg 

gagggactca 

acatccatct 

ggacttgggg 

agactgaggt 

gtccccactg 

ctgattagct 

tgcccaggct 

ctaccaggtt 

tacaggtgcc 

tgggctttca 

gatctgccca 

ccgcacccgg 

aggcatcacc 

ctggcttggg 

agccccagca 

ccttatagag 

tgtatttgtt 

tcaactaagc 

actggcccag 

cctgggtgag 

tcactttccc 

aatctattgt 

cacagctgca 

tagtcacgaa 

cttaggtaca 

cactgagctg 



tggaaatagc 

agaggggcta 

ctcccgcact 

tctcttctgt 

tggataagcg 

tagcaaaggc 

tgcaggtagg 

agaatgcaga 

caattggcag 

ccgtgtgatt 

ggtgacaagc 

atcttctcct 

ctgtccacct 

tattagccaa 

caccctgcat 

tcagagtcac 

tgctgtcctg 

tcaccatgtg 

ccactgtgtg 

gacctggctc 

ccctcctggg 

gtgactgtga 

tcctactaca 

gcagtagccc 

gcatgtgcct 

gcttttctgc 

ttcattccca 

ttaattgaag 

agtcaccaat 

gtcatcagct 

atgggacacc 

tttttttttt 

ggaatgcagt 

ccagtgattc 

tgccaccaca 

ccatgttggc 

cttctgcctc 

cctgattagt 

ttctctggga 

tgacctttct 

tggagctcca 

cttccagggt 

tatttgctaa 

aaataaatgt 

tagaaaagta 

aattctccag 

tctcaaagcc 

gaggacctgg 

gcagaaccag 

taatcagaag 

aacttgaatg 

gacgttagca 



tgtgtccgta 
agagcatgcg 
tagtagccct 
aaatggaaga 
atagaatctg 
tcaatcaaca 
gcacccatct 
ggaaggaaga 
tttttcctct 
ctctgactca 
cccttcccag 
ggagtgggag 
ggttgaagga 
gggtgagggt 
cctgggccca 
cagggaaatg 
gggcgacatg 
ccagctccag 
ctcagggtca 
ctccaccttg 
caccccaata 
agtgagaaag 
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atttctgtag 

ggggtggatt 

tctttccagg 
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acctatcctt 
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gcttcttatc 
tttgcccaac 
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ccaaagttct 
tgcttcttcc 
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cccctgcctt 

ctctcaccca 

gagacagaat 

cagctcaggg 

gcctcccaag 

ttttgtattt 

ttgaattcct 

gggattacag 

tgtcctttgg 

gtccctccct 

aaactccctt 

ctctaatatc 
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tattacaaaa 
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ggctcacccc 
ccttgggttt 
atcatgatac 
aaattaatag 
caattctgaa 
tgtgggacat 
gagccttagg 
tcaaatgagt 
agccaatccc 
ccctgctccc 
ctgcagcccc 
agctggataa 
aacaccatcg 
gtgtaatttg 
tgcatggctg 
gcatagcaca 
atacttggaa 
ggcatatatg 
tttactcccc 
attctgggac 
taaggttcta 
ctttaagggg 
ttccagatgg 
cattcaactg 
cctaacccca 
gagcatccca 
aaccaactct 
taataaaatg 
gctcaaagaa 
gctctcttct 
tccaagagta 
acctctgtgg 
ggctcccatt 
tctggggacg 
tggggaggtg 
gccagcagaa 
gtaaagccta 
aaggattatg 
taggactgtg 
ttctgttatc 
ggtggagtaa 
gagtttggac 
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aggcctttct 
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cccatgtgag agctcacata tggttacaaa ttcttgaaga aaatgaggct 169911147 
ccttccttcc ccatccacct tcctagtacc aaagtcagga cagacaggtt 169911197 
ttctgtgtta gttcacttct cattcacctt tgtattaaag gtctagctct 169911247 
ttgaggttca agttttatgt tgggacctct tcccacttgg tagggcctgg 169911297 
gttttaccta ctgggtttct aagcatattt tgtggttaac atcagaggcc 169911347 
aggagactca tcagtgtata attaattctt. gtataactaa ttatttaaag 169911397 
gcagtggacc tattcactag caccctgctt tcggagtcac. tgaattcatg 169911447 
tttgccctca gctaaagcca tcctgaacct ctttagatgg gagaaccctg 169911497 
atgctggtac ctgtagcatg ggggctgggg ctggagaagc caagtagtac 169911547 
actagggacc tcacggtcac tgtgagggat gggggtctct gcttggcttc 169911597 

tgaacaccca aaagtaagct cttctgcctt cactccatgt ccactggagc 169911647 

ctgggatcct ctcataagaa gctatattta ctctaatgaa aaaaaatctc 169911697 

acctgggctg aaaaagcgta tgctatacat gagccttggt taacctaaac 169911747 

tttcatccag agtttcaagg attaaattcc taagttcttt ttaaagcaaa 169911797 

gagaagttag ttccagtcct aggccatgat aatgctcttg gctgagaaaa 169911847 

cctccttcgt ttttttaaaa tatggattct gagggtgcaa gtgcagtttc 169911897 

gttatatgga agtattgcac agtgatgaag ccatggcttt tagtgtaccc 169911947 

atcatccata tagtgtacat tgtacecaac aggtaajttca ccatccctca 169911997 

acccccctcc tctgttttaa ggaactggca tttatgcaac taattgataa 169912047 

aattatcatc tgtctcttaa ccagattaat agtataatta aggaaatctg 169912097 

ggatgatgat agcaggttta ctgttgatga aaccattgaa cacttgattt 169912147 

aacccattaa cagtgtttct gaaagtatgt tccttgaaac actagaccat 169912197 

gaaaagtcct ttatgttgaa atatgcttaa gaaatattat atgcttgaaa 169912247 

cttctcttgg aaactcacga agtatattag acattaaagg ctctaagaaa 169912297 

tcctgcaata aagaagccta tttaatttta tttgactcgg tgttttcaaa 169912347 

ttatgtgtga ccacagaaat agaaggaact tggaggagta ataattcatt 1699123 97 

ttttgggtag aattcactac tgtcacagat atttgtatgc tgtcaatgaa 169912447 

catttaaatg tggattatag agaaagtaag atttattcaa caaatatttg 169912497 

ctgaaagtcc actatgtgct taccactttg agtatacatc ggtgaataaa 169912547 

aaagatatag ttctgtctca tggagcttgt aggagcattg gactgtcgtg 1699125 97 

gaggcattaa gaaagaagta gcaagtgcct agaatccagc tgcccctcaa 169912647 

attgttggat cacagccgtg gaacaggccc cagagctggg aagttgactg 169912697 

ggcagttggc tgttgtatcc agcctgggag atgcaaaatt tattgacata 169912747 

actgtacccc tcaagaggac cataaagaag accaagcaga aacacacttg 169912797 

gfctctggaac agaaaaaagt ttatctagga aaggaagttt aggaaagact 169912847 

gttagataag aaactgggtg gtttttaagg cttgcagaat ctcttattga 169912897 

tctgagaatt cctcccaaaa caatccctct aaaagtgcct gattcataat 169912947 

ccagtgaatg ttaacaggaa tgacgatgat gattatgatg gtgatgatga 169912997 

tggcaatgat gatgatggtg atgatgatgg tgatgacaat gatggtgatg 169913047 

atgatggtga tgatgatggc aatgatgatg atggtgataa tgatgatggt 169913 097 

gatgacaatg atggtgatga tgatggtgat gatgatggca atgatgatga 169913147 

tgatggtgat gacaatgatg gtgatgatga tggtgatgac aatgatggta 169913197 

acagtgatga tgacaatgac aatactggtg atgatgatgg tgatgacaat 169913247 

gatggtaaca gtgatgatga taatgacagt actgatgatg atgatgatgg 169913297 

tgatgatgat gacaataatg gtgatgacaa ggttgatgat aatggtggtg 169913347 

atggtgatat tggtgatggc aatgatgata gagtttaatc tgcaactcct 1699133 97 

tcccttcact caatgctatg gccaattggc caccaagaac tgtgagtcta 169913447 

cttcataagt aactaactct taagctcatc ctttaatttc tttggaacag 169913497 

ggcttaacct ttctcaacag ccccctcatc tcatttacct tctttcagtg 169913547 

tgtgttcttc caatctgtct ccacactgtg ccaatgtaag cttcctaaag 169913597 

tttaagtctg ggtacaggcg cactcactca ctccgtgcac ttttacactt 169913 647 

ttatgccttt tttgtgttct tctctacttg tgttttctcc ttcttgtctt . 169913697 
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gagggtgata gcaaggaaga atgcatctga gatgccaggc tcaggagatt 169918947 
cctggagtcc tgcactgtgt gccctttggc ccagaaggcc cttgtaagca 169918997 
cctttaggtg ttgtctgcat ttcccttccc tttctggtca gggaccccag 169919047 
ctctctctgc atggattctc ctctccagga ggtggggggt gggagccgat 169919097 
gtcaacaggg aagaggggca gaggaacaag gacagggatg gggagaggaa 169919147 
gaaagaagga ggtgggatgt ggcagctggg cccggaagca ggctagagcc 169919197 
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agaccattta 
ctgcaatttc 
ttaatttttt 
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ggaagccccg 
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ggatatttca 
ttctttctct 



aggccctttg 
. cacaagtcat 
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acacctctgg 
cttgagagta 
cggatttact 
tgggagccag 
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gcctctgctc 
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tattcactaa 
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tttaaaggca 
cagcaccatg 



ataaggacca 
agcagggaag 
agctgaattg 
cggggaggag 
cagagccctg 
ggttccatgt 
ggacacagtg 
aagagacaga 
acaggggtct 
ggagggtagg 
tgtcccagca 
cctgccacac 
gctgaggaca 
gaaagcatct 
gacagggaat 
ggggtagatt 
aaggagtctc 
ctgttgaggc 
caagttcccc 
aggggctttg 
gatgggaagg 
gtcctcgcta 
aatggacttg 
gggtgcactg 
cagcggtgta 
acagaagaaa 
ggatccttgg 
ttgcagggag 
taaatgtggg 
agaaaattga 
ggagagcctt 
tcagggcatt 
ttgccccacc 
caggagagga 
aagatctggc 
cctgtgctgc 
cctctttccc 
acctgcagga 
aacttgggtt 
cacctcttag 
ctgcccacca 
catgggtgct 
tgggaacccc 
ctgcttcctt 
agggaaagga 
aaatacacat 
ttcagtggca 
tggttccaga 
gaagccactc 
gctttctgtt 
tcgtgcaata 
ttttagggtc 



aatccctttc 
taatttacag 
ctgcaaagag 
gggctgaagg 
agctcagact 
tgagctcctg 
acaaacgaga 
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gacgcagtgg 
cagggccaca 
tctcagatca 
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tgatttttgg 
ctccctcatt 
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gctcatctga 
tctggcccta 
attttcttgc 
ttggatatac 
aacataaaat 
ttaattacat 
actttctcat 
cccattccca 
tctgtggatt 
tgtggccttt 
tatctacgtt 



cgttgcttgg 

gaattcaaag 

gaacctcaat 

aaaagcttcc 

cagcgatcgt 

ctgtgtggca 

cagatgccaa 

gggctacaca 

cttggggcag 

tgaccagcta 

caagcaaagg 

ccatgtggtt 

agatccggca 

ctacagatga 

tttcaggaaa 

caggaggtgg 

gtgaagtttc 

aaaacttgaa 

acggcccggt 

acttgcccaa 

agggagggtt 

ttcggttgcc . 

ggttcatgac 

agcccactct 

agtagtcctt 

atgccccact 

ttgtgccctt 

atctctgtga 

cccaagaatc 

cctgctcact 

ctgacagcag 

gtcccacccc 

gaactgcacc 

tgcgtgtcca 

caacaaaaaa 

agcagacttg 

caggaaagga 

cagcagtgaa 

ctctgtgtca 

tcagaatggg 

caccctccct 

gtccctctgt 

ctctgcttca 

tatttcccag 

tcagcccatc 

gtactatttt 

tcacaacggc 

cacctcagat 

cctcccccaa 

tgcctattct 

tgtgcctggc 

gtagcatggg 
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tcagtacttc 
gtatccctct 
tgtaacctct 
tgtagctggg 
ttttagtagc 
ctgacctcag 
aggcgtgagc 
acattccatt 
tgaacctttg 
aaatattcat 
ggtatatacc 
aacttattga 
acgatccctc 
aatgttcttg 
ttttaatcac 



actgatagcc 
aggagcctgg 
catcttccag 
cgtgagacct 
acacttacta 
ccacgctcac 
tttgcagaac 
gctctggttc 
tcccaccagg 
caggcttctg 
cctgagagtt 
. gcctcctgtt 
cagactagac 
cccaagggtg 
ttgtcactgt 
ctcaggggtg 
gatgaagagt 
acagtaagtg 
tttacagagt 
tcctacttct 
. gggataatgt 
acagagcctg 
attccaatca 
cctgcctcct 
agactattca 
aggtcacaga* 
agggattccg 
atgtgtgacg 
atctacatgg 
ttgtttccca 
aacaatagac 
caaggagtga 
tgtgacacac 
cctaggaaca 
ctgccagcca 
ctgatagaag 
ggggcagcag 



aaccctgttc 
gttgctcagg 
gccccctggg 
attacaggca 
gacggggttt 
gtgatctgcc 
caccacgcct 
gtgtggatat 
gattgtctca 
gtacaagttt 
caggagtgga 
gaaactgccc 
aagcaaggtg 
tttttatggt 
caggcagaga 
tttgtgatca 
agtccctgac 
ggctgtggac 
gtctagtgca 
aaattcttaa 
agccgtccct 
ctcatgatga 
ccctccaccg 
cagcgcccta 
tcccctggcc 
cctttcccca 
gaggctagag 
aggacggctc 
tggctctggg 
cactttcagc 
tagtgggatg 
aaatgagcta 
ctcagtttat 
gatgaccacc 
acagtgaccc 
gacgaggtgc 
aggtgaaccc 
tggcaaagaa 
cagtggctgt 
cactgaaagt 
gaggtttgtg 
gagggcagcg 
agaaagtttt 
cttgccaata 
tgactgctac 
atttattctc 



ggacagaaga 
acacacccca 
aggacaccac 
tcctcagagc 
agcaggaagc 
atcttttggg 



tttttttttt 
ctgaagtgca 
ttcaagtcat 
cgtaccacca 
tgccatgttg 
cacctcggcc 
ggccacttca 
accacacggt 
cattttgacc 
ttatttgaac 
atcgttggct 
aacagttttc 
caagggtatc 
ccttccagtg 
tctctcaacc 
atcagactgg 
ccagaaggtg 
tgggcagttt 
catgtgcgac 
ccattacccc 
gcagtgtccc 
atgcagcatc 
aagcactact 
ggctggggcc 
tgagatcaga 
ggatgtgttc 
gaaggaccac 
cgtctgtcgc 
atcagcctgc 
tcgttattta 
atgaaaacaa 
atcccagtaa 
ctaagcatca 
gtatgctcca 
ttagaggtga 
tgagaggtca 
caaatctgtc 
aatgcagtgt 
ttcaaggagg 
gacaatggag 
gaggtgagat 
atccctaagg 
ccctcttctc 
ttcttggaga 
aacaaattac 
tcactgccct 
aggtaggcac 
catccttctc 
tacattcatt 
cgaaaagcaa 
acagttaaag 
cttcatgtca 



tttttttttt 
gtggcacgac 
tctcctgcct 
cacctggcta 
gccaggctgg 
tctcaaagtg 
tcccttttta 
tttgtttatt 
attgtgaatc 
acttgtctct 
catatggtaa 
cataacagct 
agtttcccca 
ggtgtgaggc 
aactataaaa 
ctggagaaca 
gccatctcaa 
gggacccggg 
ctcatccgat 
accagtctat 
aggcccactc 
cactgtttcc 
ttgtccaaga 
aaactagcca 
ttcttagctg 
tcaagcgggc 
ctcctgttca 
gcttctgttg 
cctggttcgg 
gcctccagtg 
ttcccgtttc 
aaagctttaa 
ttatcaaaca 
aacccagggc 
gcagtctagc 
tgatctccca 
tgactctaaa 
cttctcacga 
gccctttggt 
aaactggcca 
gagggtctca 
tggagccttc 
ccctcacagc 
atgaatttgg 
cacaaactga 
gaaggctaga 
tagaacttga 
cagcccccca 
ccagaaaaga 
acctttgctg 
gggaacaggc 
tagtattgta 



ttgagatgga 
ctcagctccc . 
cagcctccca 
agttttgtat 
ttttgaa-ctc 
gtgggattac 
tggctgagta 
catcagttga 
atgctgctat 
aattctttgg 
ctctgtgttt 
gcaccattgt 
catcctcatc 
tctttttact 
tgattaaatg 
ggaagatgag 
atgttttttc 
aggggcaggc 
cctcatagta 
gtccacacgg 
tcacttgttc 
ttcttctgct 
aactgctcct 
gtatcaacca 
cacagctgcc 
aatgcctcaa 
ccagcatgca 
gtctctcatg 
atctacttaa 
cttccatttt 
gtagagtagt 
ctcctgagcc 
taaacaatca 
tgggggcctt 
ccccacttta 
gcagtgagtg 
gttcatgctc 
agggttcatc 
ttgcccagat 
ggacactcca 
gccggcgggg 
cagccacagg 
ccgcaacaaa 
ggggctgtat 
gtgggtttaa 
aggtccagat 
cccagaagcc 
ttatctgtct 
ccccttttgt 
ctgagagctg 
tcccaactct 
aagacaattg 
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ccaaagtcag 
tctgagcctc 
gactgactcg 
gtggctcacg 
catctgaggt 
gtctctacta 
aatcccagcg 
ggcagaggtt 
aaactgtctc 
ggaaaaccct 
cccttactct 
tgggaagcac 
tcgggctctg 
ctccaacagg 
ggcctcctgg 
aattctcctg 
ttccccaggg. 
tcctttcagt 
ctatttattt 
attgagttgt 
atcgcaagca 
atgaggaaac 
aacatgagga 
tttgtgcaat 
ctagtttggg 
gaggctaaag 
ggcaacatag 
caaaaaatta 
ggagactgag 
gagctataat 
tgtctctaaa 
tttaagtcag 
gctggacata 
tcctaatttg 
gatctgttag 
gaccagcagt 
ccctcgaatc 
ggtgatgatg 
tacacaccag 
agtgagctat 
taataggcct 
ctcaaggcac 
cccagacaca 
gtagtgaaag 
ctggagccag 
tgagtgacct 
tcatgtataa 
aaaaattcaa 



ctaatgagtc 
ttcctgtagg 
caccatctgt 
gagtacctag 



acaagatcag 
agtttactca 
tgafcgaagat 
ccagtaatcc 
caggagtttg 
aaaatacaaa 
actcgggagg 
gcagtgagct 
aaaaaaaaaa 
ttgctcagag 
gtaaaatctc 
agtcccgggc 
tctacctgga 
gagatgtctg 
tgctcctcag 
cagaatgtac 
tggcttccta 
cacatgggga 
attttttgcc 
ctgtgtcagg 
atctttccaa 
tcatgcatag 
tccaagagct 
ttcacagact 
cctggcatgg 
cagcagaatc 
cgagacctca 
gccaagcatg 
gcaggaagat 
cacaccactg 
aacaaacaaa 
taaattgtga 
aagctagaat 
gcttttagct 
gtaaatctct 
atcagcacta 
tgggagtcag 
atacatgctg 
cttctaacat 
tatattgacc 
ttttcttaac 
agcttaatga 
cagcatgatg 
cctggagaat 
gctgacccaa 
tggccaattc 
gaggaaaatg 
tgggctaaca 
cttataaatg 
taaacctcca 
cctggatgat 
tatgtgcggg 



ttgtgtgatc 
cccataaaat 
gaaattaaca 
cagcactttg 
agaccagcct 
aattggccag 
ctgaggtagg 
gagatgcact 
aaaaattaat 
aaatgatggc 
ctttgctaga 
ggaggcaggg 
ttttgtgcct 
tctcccctca 
aatcacgtgc 
acagcaaaga 
acaaatataa 
ggtcaacaga 
tgataataat 
caccgggcta 
agggattagg 
agaggttgag 
tatccattta 
tttggtgatg 
cagctcactc 
acttgaggcc 
ctaaaaataa 
gtggcacact 
gtcttaagcc 
cactccagcc 
caaataaaca 
caaatggaaa 
ggaaaatgca 
tcctagcagc 
atagcagtgg 
cctgggaacc 
gcccaggcat 
aggagtacga 
tgatatcgca 
aaatccccat 
ttcctactta 
aattagttcc 
ggctggtttc 
atcccagcat 
atttgaatct 
ccttaattcc 
actattttat 
gaccgtatgc 
ttcccacgat 
agccatcaaa 
tattgttcag 
gttctttcac 



ttggcaattc 
ggggagaacc 
caaagacaag 
ggaggccgag 
ggccaacatg 
gtgtggtgat 
agaatcgctt 
ccagcctggg 
gcaaagacaa 
cacaaagtca 
ggaagagtta 
aaggctatcc 
ccacgttccc 
gtgtgatagt 
tctgggcagg 
ctggaaacga 
tgagaaggcg 
gagcaatgac 
tataataatc 
ggcacttcac 
aagcaggagt 
tgtgacgctc 
ctcagacttt 
cataaaattt 
ctgtaatctc 
aggagttcaa 
aacaaaaaat 



cctgtagtcc 
cagaaattcg 
tggatgacag 
aatactactt 
aatgcaagaa 
tgagctcttg 
ccttgcaaag 
ttcccaagac 
tgttagattg 
ctggtttaac 
accacttatg 
gagaaatttc 
tataaacacc 
gcatacactc 
agaggcacca 
atgctcatgc 
tgctgttaga 
taggtgcacc 
ctccctctgc 
gtattccatg 
ttagtatggt 
tatgaactgc 
cctatgtcca 
atgcatattc 
ttatatcatt 



actttgtttc 
acgtctacct 
gctgggcgca 
gcatgtggat 
gtaaaacccc 
gggcaccagt 
gaacccggga 
cgacagggtg 
aattagtgtg 
tattactccc 
aaagtccaag 
tggctctgag 
tttcatcact 
aatataatgg 
aaaaaggtgc 
gaaaactgtc 
cttcctctct 
tgtcctgagg 
acttgcattt 
tgcatgagtt 
tattttacag 
atgttagcct 
ttgattattt 
ttttagagta 
agcactttgg 
gaccagcctg 
aaaaataata 
caactacttg 
aggttgcagt 
aacaagatcc 
tgctcattca 
aaaaaaggaa 
ctggggtggg 
aagaaaacaa 
atggtctcct 
tccctctcca 
gagcccccta 
ttagaataaa 
tgcatgacac 
atcatgtact 
atgaaacctt 
agctatgtca 
. acacttctga 
agccaggact 
tttcattaac 
ctcagtttct 
gagttgctgt 
gcttagtatg 
tagagtgtgt 
tgtctcctgc 
caaacttact 
tgcctttaaa 
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cagctctgca 
gttcccccca 
ccaaagccag 
attcatcaag 
ttctagtctc 
atcagttttt 
aagcccctag 
ttgccctgga 
gctcatttct 
ttcccatgaa 
atggcagaga 
tgtggaatca 
cagggtcgaa 
tggagtccat 
caacacctcc. 
acctaatcgt 
ttcctaatca 
cagtggggct 
cagtctacca 
.tcattctgtt 
tgctgaatat 
gagttcatgt 
cacttaactg 
tccacaagag 
.agacagacag 
ccagaaaaca 
cccttccctt 
gccagccagc 
ggcacagcac 
gtaagaggca 
tgactagaga 
- tgggagagtg 
gcttcccagg 
cgctcccatt 
gtggagtccc 
ctccagcctg 
aattcagaga 
ccgtgtgtcc 
gactcctcac 
ccttccccca 
aaggttcctc 
atacacacac 
cctctcctct 
gcccttccac 
ccagcttaaa 
cctcatctcg 
tgtgtccatc 
tcattgtcac 
acaaaacatt 
ttatatgcac 
ttaatgcccc 
agtgatatgt 



ctgtagttat 
aaatgaactg 
gatttgattc 
tgtttcttga 
tgatttttta 
agaactagag 
tttatagctg 
cccaaagagc 
ctcccctggg 
aagtaccaca 
gcttagggtt 
tccacatagc 
gacacagaca 
agccttggct 
caagcccact 
tttcagagga 
tcatattggt 
gggaaggaca 
cacctattgt 
aatatatatt 
gcagcagtga 
tcaagtgaat 
tctaccctgt 
attctctgca 
acacccaatg 
ccccacatgg 
ttccaaattt 
tgtgcaaaat 
agagaaatta 
cttgctctcc 
ggaggatcca 
acagccagcc 
gtgagccaga 
ttcctgagtc 
gttgtcatgg 
acaatgtgag 
actcaatttg 
tgcccttccc 
cccctcccaa 
ggtctctcct 
tgcgccccag 
ccagcatagc 
gttgaagaca 
cctgtctcca 
tctctctcca 
cctcccaccg 
gctccatcca 
atcccttccc 
aattgagtgc 
taatttattt 
ctacttgaaa 
ccaaaactac 



tattacccat 
acttttcaaa 
caggccttct 
ggatcttgca 
gccccagctg 
caggacagat 
tgagagtcca 
taagcagaac 
ccccacagca 
ctacgaagtg 
aattgattag 
ccttgcaagg 
gcatgtgtct 
gcctgccagg 
ccaccttcct 
aagagcagga 
caaagcaagt 
gagagagaag 
gcaaagtgct 
aactgaacac 
atgaaataga 
aatggcccca 
caggtttcca 
accagcgagc 
acaacattga 
agtgggtccc 
cctccctagc 
gcctagcaag 
agtgcaatta 
tggcggctcc 
gagcctatgg 
agtcttctgc 
ggtcaggaat 
taggaagatg 
tggcaggagg 
gctgggcttt 
acccatgctg 
ccaagtgggg 
gacccagtca 
ctctccctgg 
ggctaagtcc 
acatttttct 
tccccagaag 
cctggcatat 
tgtttggcct 
gttttgcatg 
ttgagcctgt 
tggagacaga 
ttattttatg 
agcttcccaa 
gattaagcaa 
acagctaata 



tttctgcatc 
agtcacacag 
ggcttgacag 
gatattttct 
ggcaggaagg 
gcttagaatg 
ggctacaaga 
aaatactcac 
tcttcgaaag 
cagccaaggt 
agcaggaaaa 
aggtatcaca 
ctctagcctt 
aacccggcac 
caaagggtaa 
actttggagt 
cacacagcca 
ggagaatcgc 
ggaagggagg 
gagccaggca 
cagaagtact 
cacctcgatt 
tatgtgccag 
tcatggccca 
aggcttgcac 
actctaggcc 
cacatgcctt. 
ctctcagcag 
gcgtaactaa 
ctttcggaga 
ctgaaacctc 
ccgcaggctg 
cagaaggcct 
gagtaaattg 
agaacagacg 
tagcaaatgt 
tatcttcttt 
caggctagaa 
atctataggg 
attgtggcaa 
tggtcccatc 
agaactcaca 
ctgatccctt 
gcctgtgttt 
tctctgcctc 
gatccctgcc 
gaggggaaag 
gtaaaatagc 
tcaagtgtga 
aactccatta 
cagaggcatg 
agtgagagag 



aggaaactga 
gtattaagtg 
tggcactcaa 
catattgcaa 
aagcaggtgc 
tatctggttc 
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cacttctctg agcctcccca aaaattgaga ataacagttt ttattccata 169978747 
gggtagtttt gagaattggt aagatgatca taaatgtaga agggcctggt 169978797 
actcataact gttttcttca tttatttatt tagtcaacaa tgttttctga 169978847 
gcacctaccc tgggccaggc acactgaggt gggccttggg attcatcagg 169978897 
gatcaaaggg aaaaggcatc ccatctccgt ggagcttcac atcatcgtag 169978947 
ttaaggtgga taatgactgt ctcgacagat aggacagtgg ccaggacaca 169978997 
ggccttcagt ctggagcatg cttctcattt tgaaataatt tcttgttcag 169979047 
agtcaggttg gaaggaaggc cttgcagggg ctgggtaagg atggcggcat 169979097 
ctctgcgttc acgtctgtgt gctcaccatg agagcccacc tctctctggc 169979147 
ccaagtttcc ttctgtaatg tgaaagaggt ggggttccat gctctcctct 169979197 
tcagcatcct ccttagatac tcaatctggc actcatcagt ttcggaccct 169979247 
ccatgctgcc ccaaaggaag ccacactgca ccttcatcca gtttcttaaa 169979297 
cctgcatgtt ctctcttgcc tctagacctt cgcccaggct attccatctg 169979347 
cctggaacat gcttccatac acctgccttc acttggctaa ctcctacttc 169979397 
tgccttgtgc cactgcttgg attgaatgtc ctctggaatt cagccttcct 169979447 
ggatgtccct gagctgggct agggtcccct ctgcccacct ctgctctttg 169979497 
attttccctg ttcatcaccc cagttatgac actgagttgt aactgggatt 169979547 
tctaatctgt cgcctccacc aagcagcaca ctctttctgg ccagggtcca 169979597 
tgtcttattt gtcttggtat ctctagggcc cagaatctgg tacctagtga 169979647 
atgtgtaaca aatattttct ctgagctaaa ataggaacac agggtccctg 169979697 
caatgaagac cccaggtctg gaggggaaga caaggtagca ttacaagata 169979747 
acaggccttc cctgcaaagc gtcagcactc agactggggc attgtggaca 169979797 
agaagggtgg agaggagaga aggcatgaag gcatcccaga tagggacaca 169979847 
gcagaaacaa agacaaaaag gccggataag caagatctct ttgagccggc 169979897 
aggttaagat gggtagtgga aagctggacc actgtcaggt tttgacccct 169979947 
ccagacaatg caccgtgatc ttctggcttc catatgtccc tggctgctgc 169979997 
tgcctgctgc ccactgaggg acctcaccct ccagactcta agtgatctta 169980047 
agccaccagc cttacaaatt acagcagaca gagggggtcc aggcccttct 169980097 
agagatagtg ttcacagcac tttcccaaag ctggttctgc tcaattccct 169980147 
aaaagatctc ctgcaatggg gcctccaggg tcctaaaagc accatctgct 169980197 
cttaaagagt -ctcactgctc attagcatat caaaggctct gaaatgtcct ' 169980247 
gcaggaaaaa actctgggct aacttttgta gccctgttcc ccaaactaaa 169980297 
ttgaatgaag gattcttttt ttatgaccca cctacagttc tgcagagatc 169980347 
ctccagggaa tgcagatcta acccattctc tgtgcagtga atggcagaga 16998 03 97 
aaattaaagt gattgcatct attgcacact tacttttcct tcgaggtagg 169980447 
tgagagagta aataagatgc aggagagaat tacaaccgga gatataaaac 169980497 
agttgtctta cagggaatag . tctgtttgcc cagggaggag gttttcagaa 169980547 
tacaaagtac catttcttct ccacgacatc cccaggggag aaaggaaagt 169980597 
gaaagaagcc ctccaaccga ccactggaga tgggagcatt tctgctggtt 169980647 
tcatttacat aagtgtcctt ggcaatttgt cccaatttgt ctcattataa 16998 0697 
atgatgtaaa catccgggcg taatcacaga caaaatggga tggtttaaat 16998 0747 
gctgcctgat taaaaaccct gggttggctg caatgacttg caggcatgct 169980797 
aacggcttcc ctgcaaggga ctttggtttc aaagttctga gagctcagga 169980847 
gggtccctag gtctagttgc aaagtcatgt cccccatgac tctgcaactc 169980897 
tccacataga gcctagcaag aatgcatcat cttctctata tctcttataa 169980947 
ggagggaagg aagaaaggga gggaaggagg gagggggaga gggagggaga 169980997 
aaggaaagaa gaaaagaaag gaaagaagga aggaaggaag ggagggaggg 169981047 
aggaaaggag gaaggaaggg agggagggag ggaggaaagg agggaggagg 169981097 
gagggaggga gggagggagg gagctaactg acattttccc ctttgccatc 169981147 
tgattggcat gaagggcaat gtaactgagg agtctgggga ccaaagtgaa 169981197 
tcctagtaat gtcactcact taatatgtgt gattttttag accaatcaca 169981247 
tcatttggcc aagcctcagt ttcctagtct gtcaaatggg aatgatgatt 169981297. 
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aaagagccag 
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cagaaaagtc 
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ctttaatgta 
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tccctgtggg 
gaagtctatg 
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gtctccttag 
tgccatctat 
tatgtctcct 
ccgtaaaaaa 
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. tattgtctca 
atgctggaga 
cgtattgcaa 
gggtaattgt 
gaaagagatt 
ctcggcaaat 
,tggctggctg 
gtgcagtgtg 
cttctaatgg 
tgcctgtaat 
cccaggagtt 
taaaaataca 
gctactcggg 
gttgcagtga 
tgagacccta 
aaaagaaaaa 



ccaaaggagg 
gctcctgtgc 
aattttaaga 
aatattcaaa 
atacaaaaaa 
atatagccag 
caccccccaa 



ccaccctgcc 
agagagtagt 
tgcattcttg 
cgttgtgtaa 
gtatttcttt 
atagacaatt 
acatacaaat 
taggggtgga 
gaaattgcga 
ccagctctcc 
ccaaaagaca 
aatatttgca 
gaataacaac 
cattctaatt 
tttaggccca 
tgaagggcag 
aaatgtgggt 
aacagtcagg 
tgggtacccc 
accatattaa 
tcaggggaag 
tttttccttc 
gcattaaccc 
actgtctcca 
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aatcttctac 
tcctggacat 
acaaatgaat 
gtggattaat 
tatgcattta 
cataatttcc 
tactatttta 
gttcatgttt 
gtacagtaat 
aaataggcgt 
tgggggaggg 
agtgcctggt 
ccaagaatag 
cccagcactg 
caagactagc 
aaaagttagc 
aggctgaggt 
gcccagattg 
ttaaaaaaaa 
agaaaagaaa 



ggtgccctgg 
agttgggaga 
aatattgcta 
ttccttttta . 
attgtacata 
acctatgaaa 
agtttcttca 
atcccccacc 
ttgcattttc 
tctggcttct 
ctgaatactt 
gtatggatat 
ccaacttttg 
ctttacgtgg 
attgctggat 
atttaggatt 
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ctcttaacaa 
aagcatctga 
tctattttcc 
aggagcttct 
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agggcgtgga 
ctcatggcga 
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atccatttct 
gcaggtgggt 
tccagctaca 
agcaggaatg 
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tacttattct 
gcctcggaca 
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tatggtacct 
agtgactgac 
tttctgactg 
ttaaatgtga 
tcaaggggat 
ttgtggctgt 
ggagacagag 
tgccaggtta 
aagacctgaa 
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ctgggcaaca 
caagtgtgat 
gggaggatag 
aaccactgca 
aaaaaaaaaa 
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agaaaaatga 
aggaaagggg 
ggaaggctgc 
ggctctaggg 
cctatagtat 
ttttgtcatt 
tacagatggg 
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tcctggagct 
tcactcttca 
gggaaggagc 
gtaggggacc 
cccagctatg 
tgtcagagtt 
gaattgggga 
ttgctgccca 
ggtatcacga 
atcatcatca 
gttgaaactt 
agcctcaggc 
cagaaaacag 
catactaatg 
gcggcacagc 
tctgcttgtt 
atggcagaat 
ctcccaggag 
ccctgggcca 
tgaaattttc 
aagggtgcta 
atcttctgca 
gtgcccaagt 
ccattcacca 
accttctctg 
tttctgagct 
gttttccaca 
tgcttcaggg 
ccgaaggagc 
aaattgtatg 
ctgggtatcc 
agtaggaaat 
aatgagggta 
ttggatttgt 
tgcatatcaa 
gttagccatt 
gcttgcagtc 
tctggttaat 
taagataatg 
ctgtatttat 
ttaaattcag 
ccacttataa 
aaataatgac 
tatactcaaa 



cctgaagtca 
gcattgccca 
aggttttcat 
tccctgcagg 
gtcaggcata 
aaaacatccc 
gaaacccaag 
ttgcccctgc 
gacatttagg 
tttttcttgt 
gagggtctag 
agtctgctgg 
aaccccattc 
gctggatttg 
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gtggtgatca 
acatatcttc 
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aaataaggat 
ttaagtggct 
attaccttca 



ctggatttcc 
cttagggtaa 
gtgtgcttgc 
gacagagagg 
tacttgtgcc 
tgtagaaaga. 
ctctgagtgg 
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ttgattctga 
ttggaatgag 
gcctgcatcc 
acatgcagac 
ctttcagtat 
ggattttgca 
ctggctggac 
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tcctttgctt 
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catccttttc 
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cctggtccct 
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tgctgagcac 
tgctccaagc 
gtcatgtcca 
gtcacaataa 

gggggg catc 

aatcatttac 
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ttcccatgca 
gtagctcaaa 
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gatattacct 
tccacctgtg 
ttaactaatt 
catgacttgc 
ttatggtttt 
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tactttggag 
gaatctagtt 
gctggataaa 
cacaaaagat 
agtcattctt 
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accgaattgt 
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tacttacatg 
tcagctctgt 
tgtcccattt 
cctgggtgca 
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gcccagattt 
gctgggtcgg 
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gaaaagtcag 
ggatggcatc 
tgtccatgag 
gcaagcattg 
agcagaaagt 
actcttcatc 
tacctctggg 
tgcccaatcc 
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agggcctgga 
ctctctcatc 
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gaagcctggc 
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taaatatgcc 
ttgttggaag 
agagtgcatt 
atgtatgcta 
aggaggcacg 
acttctttag 
ggttgaattc 
cccgccaccg 
ggtcctagat 
tgacttcttc 
gtgtgtcaga 
gtttccactc 
cttggggaaa 
cccgctgtgg 
tcagtcatct 
gaaaaatcta 
gggccccaaa 
caaaacaggc 
gacctcttgc 
aggggcatgg 
aaccctgttc 
ggcccagccc 
gtgcacacac 
agtccttgtg 
tgtttataga 
tgccccagaa 
cagaacccag 
agcctcagct 
ataaagaccc 
aatctagagg 
ccaggggcag 
gcacagtcac 
ccagtagggt 
ctgcttccca 
agcaaggccc 
cgtccccatg 
acatgactca 
tttttttttt 
agtggcatga 
ttctcctgcc 
atgcctggct 
ggccacgctg 
ctcccaaagt 
agtaagtctt 
ctgggaaatg 
gccctgccac 
gcctcagttt 
gccctccagc 
ttaacccttt 
acacttcccc 
caactgttga 
gctagtgcaa 



aactcaaaat 
aaatttctct 
aatattcggg 
atatgcaggt 
tggaagccaa 
ctggcaaagc 
gcatgcacca 
ttggccaata 
gtgttgctag 
cccagtttca 
gtagaggaga 
cacttctgac 
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aacagcgcac 
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agcctcactc 
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cctgacctca 
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actggtgcaa 
ctcacagtca 
ccagacctgc 
aggctatggc 
cacagatgtg 
gtatctaagc 
atggaaaggc 
aaaaacaatg 
gctacatggt 
gcaatttata 
agtcccaggt 
gtactccctg 
gctgagtgca 
ccaaataatc 
gtcctacctc 
tttgagggga 
gagatggcag 
atctcatagt 
ggaagggtct 
ccaggccaca 
ggaagaagaa 
acggtcactg 
ttccacgatg 
ttgggagcct 
ccatctaagt 
atttaggttg 
gggactcttt 
acttcttgcc 
ctgatcactt 
ctttggaaag 
tgctctcttt 
ggatccccac 
gggactaatg 
cttcaagaat 
aaatttgggg 
attcctactg 
ttcttccttt 
gacttcccag 
tgaggagtta 
ccaggctcgg 
tctcagaacc 
gcaaaagtaa 
gtcctcagat 
ggccccttgg 
tattgcttat 
ttccatggcc 
aagctcatct 
acttgaagtt 
tcaaaatatt 
ggcacagccc 
atgtttctca 
tataataaaa 
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cagtgtaaag 
aaatctctct 
tgtccttact 
actactactc 
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ttcattttct 
atggatggat 
gtgagtgtgt 
taactatctc 
ttaaagtctc 
acactaatga 
aaaacaacat 
tgctctgttc 
tctttgtggt 
cactgacttt 
ttaaaccagg 
accagagaac 
catcccttac 
accagcgcag 
ccgtcctcta 
gggtggatga 
atgaatggat 
tggtctgatt 
aacccttcca 
cagttctctg 
tggcctgagc 
gaccctgcac 
ggagctcctg 
gcctgtaatc 
tcaggagttc 
aaaatacaaa 



gcaagccaca 
agggcagata 
tgggcaagaa 
cactggtccc 
tcctgggctg 
gggccacagc 
atgtcagccc 
cttgcgttta 
cccagtacag 
caagagttgg 
gtgcctaatc 
gatgccaggc 
aggtggttcc 
accacacttg 
agaatcacta 
aaaattgttc 
aggctgggct 
ctttctcatc 
gatggtgtga 
atacatagtg 
gaatacattt 
caggaagaca 
ggacctcagt 
tcatgagact 
actgtcctta 
tagaatgtat 
ttccccagaa 
ggttgatgga 
ggataccgct 
ggacttcgga 
agcacagtgt 
gatgagtgga 
ctgagcttgg 
aattaaaagt 
aatgaagcat 
tttattccct 
agagaagaat 
tgaaagaaca 
ctatagctga 
aacacatact 
atattttctt 
atagatggat 
tgttggatga 
cttcctaagg 
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acctcactac 
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ggtgtggtga 
caggagaatt 
acaccactgc 
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gaacctgaca 
ccacctctct 
tctgaggcag 
cccattctgt 
gcgcaaaatg 
aatccttaac 
tcccaacctg 
ccctagcact 
cccccacagc 
gttcttgccc 
tagttgaagc 
tgcaaactcc 
agtattcttc 
tattatttga 
cattgatttt 
ataatagatg 
taaatgcctc 
ctatttttta 
gctgggagct 
agcagttcac 
agtgtgtcga 
agaagaaatc 
ctaagctgct 
ggcagaattt 
agcctggcac 
atgaggacat 
agatcacaga 
gcatggaagg 
cgaaggggtt 
ccctggagaa 
cagaatccaa 
tcattgtggg 
gattaagcag 
ataatagcat 
cggcttctta 
tgcctcattg 
gcagaaaata 
aaggcaagca 
tgcctaggca 
gatagcaata 
ttgttttcag 
ttacagatca 
atcttgacaa 
ccatgaaaat 
attcatatgt 
tggggctagg 
tgagacaaac 
cacttactag 



tgcagcacac 
gcttgagcct 
actccagcct 
aaataaaaaa 
catggtccag 
gtggtccttt 
tctggaagtt 
tctctgcatt 
ttgttccttt 
acggtctaca 
atctgtcaca 
aacctttggt 
cctcagaacc 
cacctaccac 
atcacttcct 
agcctgggtt 
ctttatggca 
gttatgtctg 
taatcattgt 
cttaataaat 
attcaagagc 
gctcttaatt 
tagacagtaa 
aatgtaacaa 
gtgctacaat 
cttctattcc 
tctggaagga 
caggtgaaag 
tggaaacaaa 
ggggctatga 
ggcctttcaa 
gactggcatg 
cagaaaagag 
actgaggctc 
acgcttccaa 
gagaattaat 
tgtaatcagc 
tgagagctgt 
gagcatattt 
ccagcctctg 
gaggagaaat 
ttgttcattc 
ataggtttta 
atatggtaaa 
aatctataga 
tcctggacta 
catttgtttt 
gtgttattta 
gctgtccccg 
agtcctcccc 
tttggagtca 
ctgtgacctt 



ctgcagtccc 
gggaggtgga 
gggcgacaga 
aaaagaagag 
gtgtctccct 
aggcattaac 
cataccccac 
gcagatggtt 
tcttaaaacc 
atattctgca 
caccacctcc 
tcattcctag 
tttacacttg 
ctagttaatg 
caaggagggc 
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ccaatcacgg 
tctcccccag 
atccctagtg 
atttgttgaa 
tttggctctt 
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agcttagatg 
aaaagagaca 
agatggaggg 
tcactggagg 
aaaggtattg 
gaccagcata 
atgcaagtca 
gagatgaggc 
tgacagcaac 
gtgagacctg 
atgtccatat 
tgccccaggg 
aggatgttca 
ttcagggtgg 
tgctgctgag 
ttttaataac 
aaaatatgtc 
aagacaaatg 
tggaggattt 
tgacaccact 
gttggtttgt 
ggagatattt 
atcaccattt 
tctattatag 
acttagttga 
aaagtgttcg 
cccaaccccg 
tcttgggtaa 
gacagacctg 
ttgcagctta 
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gacatctgga 
gaggacttct 
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aatctctgtg 
taaaacacac 
cattgtggcc 
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tttctttgtg 
gtaaactaca 
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gtgcagtttt 
cttcactaag 
gccaagaaaa 
tgcaaaggct 
cttcgtggac 
tggggaggga 
atatggcagc 
tgatatggag 
ttcaagtgaa 
ctgtccatca 
gtgtggctga 
agaattttaa 
agttgcaaat 
tcaccagtgt 
atagtgtgtc 
tcttcccagg 
gattgcaaac 
aattcaaaat 
cttcaagggt 
ataatttgta 
tcatgcagtc 
caggtcccca 
gttactaaat 
atgaattctg 
attagtctaa 
cagacttgac 
agttcaaatt 
aacactgagg 



gagactgagg 
agctgagatc 
gtctcaaaaa 
gcaggcctgt 
ggaagtctcc 
cactgtgtct 
taccttgtcc 
acacatacac 
aggctagaca 
cctgggtgcc 
gttccctggg 
agcacttagg 
tgctttaaat 
cttcagctct 
aaactcatta 
catggccctg 
atacttatgc 
gtattagaat 
agcctggctc 
agtgaatgaa 
tacattactt 
ctgggcttat 
gttgctctca 
gtgacaatac 
atggatgtgg 
gagatattcc 
taagggtgga 
cagagggcag 
tgagacctgg 
gacaggggcc 
tgcagagaca 
aactccctct 
ggacttcctc 
gactccttcc 
ccttgcttat 
aaaaatataa 
gcaataaata 
aatttagcct 
taatggcaaa 
agggaggggg 
agggtgtagg 
ctgtgatctt 
gaaacagatg 
agagaagagg 
catattctaa 
ggcaactgat 
attcatgata 
agttttagta 
actggtgaga 
acagttaaag 
ctagttctac 
ctctattccc 
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gatctgttaa 
atggatcgcc 
ctaccatggt 
caagtaactt 
agtatctgtt 
tactatcttg 
agttcctggt 
ctgaatgtat 
gtcactctca 
agttactcca 
gtgggcggag 
gtaaattagt 
ggtgcataga 
agcaggttgt 
attgtgatgg 
ggcaatttat 
ggaaggaagg 
gagggaggga 
tgattataaa 
cttctagaca 
atcattactc 
acattttgtt 
tggtgcagtc 
tcgactcccg 
ctgtcagctt 
atgaagaatg 
tttatcccaa 
aaattgatct 
atgccagggg 
tctccagtct 
gaccctggga 
aagcacaaca 
tagatgcaca 
tctgtctaaa 
caacactttg 
gaccagcctg 
ttagctgggc 
aggtgggggg 
attacaccac 
aaaatatata 
agttcatgcc 
cgaggtcagg 
ctcctaaaaa 
gtcccagcta 
gcggaggttg 
cagagcaagc 
accactaaaa 
ctgtgcagag 
ttagtttccc 
tgctgtatcc 
tcaacaaata 
tgaacagaac 



gtgagagtgg 
tattgtgtta 
tcctgtttca 
tcctgaggca 
ctagctggtt 
tacagttggg 
atgtaattag 
cttcctgggc 
acgctttatt 
caccctccaa 
gtggggagtg 
aaaacaactt 
tcctgtgaca 
ctccatctct 
gggctgttct 
tcccaagagg 
aaggaaggaa 
aggagggaaa 
tgtgtgtatt 
aagaaggtaa 
actggcttta 
caaagcaaca 
attaatgaat 
ttctccagcc 
ggaaccctag 
ggccctgcag 
aggacatgga 
cccaggatgc 
aaaaccagga 
ccatggagct 
tggagagcca 
cctcttccca 
gagggttttg 
attgctctgg 
agagctgagg 
ggcaatatag 
ttgctgcctg 
atcactcgag 
tgaactccag 
tatataaata 
tgtaatcctg 
agatcaagac 
atacaaaaat 
ctcaggaggc 
cagtgagccg 
ccccatctct 
acatcattag 
cccagcacag 
tctctttgcc 
taactcaaaa 
tttattgaga 
tttgcaaatt 



tgatgatgac 
tctccaacac 
cagatgagga 
cagggctagt 
tcaaagccta 
ggaagatttg 
gactttataa 
tgtgtcctcg 
aatgtccctg 
agcctcaagt 
agcgctctta 
tgggaaacaa 
cagcagtttt 
gcactactga 
gggcattgta 
aaagcaatga 
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tatttaatag 
gaattggaat 
aaattgaaac 
tttgcaaatg 
aatgattggc 
gtgctcatta 
tgttttaagc 
taggggaggg 
gctgacattt 
atcctggaag 
ttggaatgat 
agcttctctc 
gactttgaga 
gccctgctgg 
atgacaggtt 
ttttgttttt 
gctgggtaca 
caggaggatc 
tgagatctct 
cacctgtaat 
cccagggggc 
cctgggcaac 
aataaataaa 
gcactttggg 
caccctggcc 
attagccggg 
tgaggcaggg 
agattgtgcc 
aaaataaaat 
tagttgtaat 
tgcctgctcc 
cccattttct 
gttgattatg 
aactgcaaca 
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agacagacct 
ggatgctgat 
tgggagagta 
cagggttcaa 
tgttccgagg 
cagttgtctt 
attaactcct 
attccacaag 
ctgaaagaag 
ttttttttaa 
gtggctccca 
gcttgagccc 
atgtctagaa 
tccagctact 
tgaggctgca 
agagtgagac 
aataaatggc 
aggccaaggc 
aacacggtga 
catggtggca 
gaatcacttg 
actgcactcc 
aaattgctct 
aattaagata 
atggaaaatc 
tatctataaa 
gagagtggca 
cgcaaataga 
aaggacattt 
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cagatatttc 

cagcatacct 

aaggaaggaa 
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tccacaggtt 

attctgatcc 

atatctaaat 
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ccgggaggca 

acatctgcac 

gtgatggaga 

ggttctgaaa 

gatcctccca 

gttttgtaaa 

gcctccttcc 
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gtgctgctta 

gccttgtagc 

cctttcagca 

cctgtaatcc 

aggcgttcta 

tgttttttaa 

tgggaggcta 

gtgaaccatg 

cctgtctcaa 

caggtgcagt 

aggcggatca 

aaccctgtct 

cgcacctgta 
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gaacttccta 
tgcaagagct 
aaggagggag 
ggaggaagaa 
gggaggtagg 
ggaaaggaaa 
agatatcatt 
aaacccatcc 
gaacacacga 
ttttctctct 
agaatcagga 
gcacctttta 
aataccgact 
gtttttccaa 
aagcccccat 
cacactgcac 
gggtcattgc 
aagcaagctg 
gtgactgcta 
tactcagctg 
ggagttcatg 
aagtaacact 
tcaatatatg 
ttatggttaa 
cataattact. 
gaatgtctct 
aactcatcat 
taggcttctg 
cttactgggt 
ttgttattgt 
aggaaactgg 
tagtaagtgg 
aaaacttggc 
ggactctgca 
aactggaatc 
tctgctccat 
acacatcatc 
tttctttaag 
tgataaggaa 
ataaaatagt 
gcaggctcac 
aatagtaaag 
gcaaacattt 
gctggttaca 
actgggatgt 
ttcacaaatg 
agactttgga 
atgagacctg 
caccatttct 
actcacctga 
agaaggggag 
tacacagtaa 



ataggtaaga 
tgggcggact 
gaagatgaaa 
gggagggagg 
gagggaaaga 
ggaaaggaaa 
attattgtgc 
tgtgcccaca 
atcattttcc 
gaaccataac 
gctctgagat 
aagaaaagaa 
gcagcagaac 
atgaggttta 
ggcagcagca 
atgtgttttc 
accacggggc 
agagcagcag 
gaggccaata 
aaaacatttg 
cacccagcaa 
ccttcctctc 
acttgtccat 
taataacagt 
aattcatctt 
gttttacagg 
agtcacagag 
acccagagat 
atcagggact 
aacaacccta 
ctccttttgg 
cagagataaa 
atttgtgatt 
gaccatggcc 
cagtctttta 
gaggccagct 
ctcacctaga 
gcaaatggac 
ttgaagagga 
aatttgggac 
atctccaata 
agcaatttgt 
ctcatcaaat 
tgtagagaga 
gagaagaagg 
ccaaatcaca 
ggcaggtaga 
gatttccttc 
aggaaatagt 
gacacagttg 
ctgtctgtga 
cagcaaagca 



ctttttaatc 
gtgaatatct 
gaaggaaggg 
gagcgaggga 
aggaaggaaa 
ggattacagg 
aattgttatc 
aaagggcacc 
cattttcgtc 
catcttgcag 
ggtgctctga 
aacttcaagt 
attctggaaa 
gacagtacct 
agccctgttg 
agataactaa 
ctctccattt 
aagtagctag 
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caggttttct 
gactcagctt 
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tgacgttatt 
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gacagtggga 
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cacaaataat 
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gaagattctt 
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caccttacac 
ccatgagtgc 
acttcaataa 
aaccttgtag 
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gggacatatg 
tcacgcagac 
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cttggaagaa 
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atttgatttt 
aaacaaatca 
ctggagaatt 
cccctacctg 
ctatacgtac 
tgcctctagc 
tctttctatg 
tacatactgt 
gaagttaagt 
agagtcaatg 
cacactccca 
tcatctcatt 
ttcccaggta 
gtcacacatc 
ttgtttggca 
gacgcctggg 
caaggaccta 
aagtcaggta 
tccccctgac 
caaacctttg 
ctgtcagtat 
ggccaaaaaa 
cagaatctat 
tctatatgga 
cccaattagg 
agatttcaaa 
ctctgatgga 
tcctgtatct 
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tgtaagtctt 
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catcattctt 
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gggacctcac 

aactttggga 

agcttatgca 

gcaacagcat 

gggagagggg 
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ttctaattta 

aagcccagtg 

acacagtaag 

tctgggcaga 

gacagtgggg 

aacaccctac 

tgctgatttc 

tgaccccagg 

aaagaacaga 

tctcaatcca 

gaggagagac 

gcacatacat 

cctcagttct 

tttggaaata 

gtgtgtgtgt 

gtggtactaa 

ttagtatctt 

ggctcacatg 

tttctcaccg 

gaatgatgca 

gtatccttac 

atgaatttac 

tgttgtcaat 

gcctgctctg 
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ggacaggtct 

aagcaccttt 
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tagaaagaag 
gggaagagtg 
tacatgatat 
ctttgcttcc 
ttaggaagtc 
cacttgctca 
aaagatgcat 
cagaaagaga 
gtgttcagcc 
aacaacttta 
aaactgaggc 
aagtagctga 
agaaaccagg 
aaggcagcat 
agagaagagg 
aaaggtgaat 
ttgtggccct 
caaagacgaa 
tggattcgct 
tcaaatcagg 
ttaaaaccta 
tatcctgtag 
gtggttatgt 
ccagcacata 
tgatgaatga 
cttggaatgc 
aaaactccac 
gggtttctgt 
cccttccaaa 
agagctggcc 
gaaatctcct 
ctgacaaata 
aaactaatta 
tccaactctg 
ccaggtggca 
cacatcagtg 
acatcagttg 
tgtgggattt 
ccaagcatct 
aacttccttt 
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agcctttatg 
agacttactt 
gcatttctct 
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tctctgttcc 
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" cagaatgttt 
tcattctggt 
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tctataaagt 
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gttgaaataa 
atactccaca 
ggaaatcaca 
tgggcacaac 
tgcccaggga 
acttccctcc 
aaacaccact 
taagacctgt 
atattcccca 
agttttcctg 
aattcaaaaa 
ctaagtaaag 
taacatgcgt 
agcagcagca 
caacaggaaa 
tctgtaattt 
gagttttgaa 
aagcagctcc 
aaggaaggca 
tgcaacatta 
cagttcatgc 
tgagacttag 
aggattgttg 
atgtctttac 
tctgaaaatc 
caaatgcatg 
aaaggagtaa 
ataaacttaa 
aatagcatag 
atcactggtt 
gagtagaaat 
acattcaggg 
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tcagtgttct 
tccccccgac 
cacgtagaat 
cacacaggaa 
atcctctgtg 
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gaacgtacag 
cattcaaact 
taccgaatca 
tttgaattga 
tcaattatag 
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ccttcatggg 
agcagcccca 
agcctaggta 
cccaagaatg 
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gtgcacaggt 
aggacccagc 
cacggaagac 
cttcaatcaa 
aaagatccca 
gcagacactt 

ggagggggtc 
agcaggctgg 
ggacagcacc 
aggagagcct 
ctggcccctc 
tgaagtgctc 
ggaaaccttg 



tcagtgctaa 
cttgcctagg 
tgtggcccac 
atggtccaag 
gttcctcctc 
aggtcctgga 
attcaatata 
taattattgt 
ttaaactgct 
acagatgcta 
caattttccc 
ttcattggcc 
ctacagccag 
ggtaccacca 
acccctggaa 
gccttttcag 
agaagcaaag 
ctctgatata 
atgagattgc 
caagatctag 
tgggaatcac 
ggatactaat 
caaatgtgaa 
tcttctatat 
atgcttatat 
atactaaaat 
gtttgccttc 
ggaaattaat 
agtttctgga 
agaactgccc 
ttttattgca 
aattggggat 
ctctcacccc 
tggggacggg 
agatggagcc 
gtctaccagt 
taagctgcca 
tctaatcctc 
aacaccagaa 
gacaggtatg 
tcttcccacg 
cctccatcaa 
actataaatg 
taaattcccc 
tttatctcgg 
tgaggaaggg 
aggcagttct 
ttagcaggaa 
caaactcttc 
cgggatgaca 
cagccaaaga 
cctcaccact 



gcctatataa 
atctcctcct 
aggcagctgg 
aggcttctgg 
agctcccaac 

gggggcgccg 

gaaaacacat 
tccctatggg 
caactaatag 
atagagacat 
caagaaccac 
agatgcaatt 
gttgtcatca 
cggccaaatg 
gatcccctcc 
gcaaagcggg 
aatcaatgct 
taaagaggat 
aacaatacat 
cttcagatgc 
ttaagctgtc 
atcacctccc 
cacacttagt 
atcttctctt 
gttggttaaa 
tagaagacta 
cccaaaactc 
ataaatttaa 
gaatagatcc 
aaggttacac 
gagggaattg 
gtggtctggg 
ctcccgctgg 
ccagccccca 
aggctgacac 
cctccctcct 
tctgctgtgc 
gtgaagaccc 
ggcaggtgaa 
agagccaagc 
tctctgcctt 
catgaactgc 
ctgaaatttt 
tgtcatatca 
gcttggagga 
ctggcagcct 
gtgctgccgg 
tgtggtggag 
aggcagatct 
ggtgccattg 
ccccaattta 
gcccactttt 



170054147 

170054197 

170054247 

170054297 

170054347 

170054397 

170054447 

170054497 

170054547 

170054597 

170054647 

170054697 

170054747 

170054797 

170054847 

170054897 

170054947 

170054997 

170055047 

170055097 

170055147 

170055197 

170055247 

170055297 

170055347 

170055397 

170055447 

170055497 

170055547 

170055597 

170055647 

170055697 

170055747 

170055797 

170055847 

170055897 

170055947 

170055997 

170056047 

170056097 

170056147 

170056197 

170056247 

170056297 

170056347 

170056397 

170056447 

170056497 

170056547 

170056597 

170056647 

170056697 



FIG. 1.131 



20040411 93A2_I_> 



WO 2004/041193 



PCT/US2003/034681 



132/154 



tcagaagtaa 

aattaataat 

atcccatctt 

ttttccaatg 

ttatgggcaa 
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tcttttcttt 

cacacatata 

taaataggtg 
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atatcgcttc 
atacatatgt 
ttatctgtga 
tttgttgccc 
tccacccctt 
gggactacag 
agagacggga 
caagtgatcc 
gccactgtgc 
ctgtgcattt 
tctgtgtcaa 
aacatatttt 
ggttctctgg 
ttcaccatta 
gctgtggatc 
atatgcctga 
gttacctatc 
cagataaatc 
agctctccaa 
atccagttgt 
cctgggatct 
tcattcttca 
acatcccttc 
ctcagctccc 
aatgaaaaat 
tatatatggt 
ctaagaacca 
ctaaagatct 
tgagtaacaa 
ttcctaccac 
tacctgccaa 
tatgcctcct 
tcatctctct 
atgcttctaa 
tgaaactatt 
ttgtcaaaac 
actatgggct 
aacagatata 
gggcggaata 
cctgaaacag 
aaatgccacc 
aagtgtctca 
ggatgcccac 
tgctgtctgg 
aagatggtgg 
gacaggcgga 
caaaatagac 
ggaccttgga 
tggtattttg 
catccactct 
cttttctttt 
gaatcaaaaa 



attatcatta 
tcaaactttt 
ttccattctt 
agcctggagt 
gggctcaagt 
gcgcgcacca 
tttccccatg 
cccgcctagg 
ctggactgtg 
ttctattgac 
acccactttt 
tcaattctac 
tggaattatc 
ttttaaagcc 
tattactgta 
cagattttta 
tggttgagtc 
caaacaggga 
agccttcaaa 
tctcaaggga 
tcagcttcta 
acactcaact 
tagtcccatg 
agttct.taaa 
tctagaccag 
acatgctgta 
ggaaacaaac 
ggactcactg 
ctgtttccct 
tctctatagt 
tgactactgg 
caccagtgtg 
cctaatacct 
acatatctat 
ctgcgtaatg 
ccacagaact 
ttagttaata 
ccacaccagt 
tacacaaact 
ctctaaaaat 
tcaccagaag 
cgtggaatac 
ctatacgatc 
gttgcagatt 
ctgagctgcc 
agttcaaata 
ccacaaaggc 
ctcacccagc 
tcttgtattt 
gccacacatc 
tctgcatctg 
cttgtagact 



tttatctctt 
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tttttttctt 
gcagtagcac 
gattctcctg 
ccatgcccag 
ttggccaggt 
cctcccaaag 
atttcattct 
ttatctacca 
aaattggtct 
aatttttgat 
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cagatatgat 
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ttcatggagc 
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ctgtaacagt 
gtacaacaca 
tctacatatc 
gcaagatgtg 
ctctatattt 
aaagtatatt 
gatcaaaggg 
aggtcaggaa 
actttgaaga 
gggggacagg 
ttacacactc 
agagcctaca 
cagcaggaga 
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ctaatgggct 
gttactaatc 
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catatgtctc 
tttttgagat 
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gttttattca 
gtggagtgag 
aagaagcaga 
taagcctctc 
attgcacatc 
catggctgag 
gaggcaaccc 
gctcagctcc 
acgacgtgga 
gttttgcttt 
tagACATCGC 
aaactaacct 
caagctcatg 
tttgttaaca 
agacaagtta 
ccttttacag 
agtaattgct 
atggtcttca 
attactttat 
gtcctttgtg 
gctatgtggt 
aaactcaatg 
ccaggggtga 
cagccggtgt 
aaaattgtca 
gtacatggat 
aagatacatt 
ccgaggatca 
gtggctcatg 
cacctgaggt 
gtttctacta 
gtaatcccag 
gaggtggagg 
gcaacaagag 
agtataaagt 
ccagggagga 
cctcactggc 
caggtctaac 
caaggctggc 
ccactgtggg 
gctggattct 
cagacaaagc 
atagcttagc 
agtacagtca 
acaattgtgg 
cagtgacttc 
tggttgtgtc 
aatgaaaata 
ataaagaaaa 
cgttatcaca 
aaataaagtt 
attaatttag 



ctgacttgta 
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gagggggaat 

ctcagtcaaa 
tctgccatgt 
ggcatgttca 
actgctcttg 
agctctgcta 
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gtttagtaat 
cacccagagc 
aatagttgta 
tctaaattta 
tgttacaatt 
tagcctagga 
taccccatct 
aaaactgcct 
cacatgactg 
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cattaaggat 
ctgtgtagca 
gataggtatt 
taatatcaca 
gttatggaac 
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ttacatctca 
tagtaactag 
tctggcatca 
cccacgggtt 
tctccttcct 
tggcaacagg 
tgtgaatgtg 
agggagagct 
acagctgtag 
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tgggttgtga 
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acttcccagc 
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aaagcttggc 
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aattacttag 
ttgagtcgga 
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acccttcatg 
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tgcattttaa 
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caaacaactc 
ttaatagttg 
tgggaaacaa 
caaagaaatt 
aattaagtgt 
aacagtgtac 
atattcccat 
ctttcttcac 
aaaaggaggg 
atatatgcac 
gatctaatgc 
ggttaaaata 
acttatgtga 
tttcatttac 
gttagctaca 
ctggggaaat 
aacaaatcat 
agcacatgag 
tcacacgaac 



ctcactgact 
taagtgccct 
ttgtaccttc 
cttaccattg 
tacaggttta 
gtagcagggc 
acacgatgac 
tgttaagcga 
tttttcctga 
gatcccaaaa 
tgagaatttc 
tgcccctagt 
aatcacttct 
cacacgtaag 
agtactttac 
tgccatagta 
aagtctgact 
actagcaaca 
aaggaggctc 
acagggccat 
aatccccagg 
taaagtttcc 
aagaaaggga 
tgaggttgcc 
gtcatgaccc 
taaaactaag 
catggagttt 
gtgtgtatac 
gcaaaagaaa 
caggcaggct 
ccacaaccga 
aaacaaaaca 
aatctgtttg 
atttattaat 
atatctcaat 
gtgctgtttc 
tcccgagctt 
actgcaaaaa 
tggatgtctt 
tttctttatt 
tcaagccacc 
atgtgaaaac 
agataaacag 
taggagattc 
gatgtaaaat 
atttttctcc 
gtagaaatgc 
aacacaacca 
tggccacttg 
ctgccttagt 
aaagatggca 
atccatcacc 
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tttggggata 
ttgggatcaa 
tgcaatgaga 
gctcacctga 
atgagacatg 
tcttgccagc 
tgcatgacaa, 
ggtgtaagtt 
cctacccctc 
tgtgatcctt 
actttctaaa 
agacatactt 
tggtttgtac 
ctatctagta 
. tatcagtaat 
tatccattcc 
aattttgttt 
cctgtgtctc 
gcaaagaagt 
ttgagcactt 
cgaagccccc 
aagtggggtt 
gtcctagatc 
cctaacaaga 
gttccAGATA 
GCCCGAGGGA 
AGCTGCAGGT 
ctgaaggcct 
aagggagctc 
agctctataa 
ctccaggcga 
cagaaatttc 
gtcaagcctt 
ccacaataat 
agactgatgt 
gaactgacag 
ccttggaagc 
acgggagttc 
cagtctcagg 
tattttttaa 
gtggtgtgga 
tccaggtgca 
cacagcctca 
cctgggagtt 
cttgactgca 
cttcgatggc 
gggaaagttt 
aagagaagaa 
taacataggt 
tacaggtaag 
gcttagtatt 
agccccaaat 



tgtacaagat 

agtctcaaaa 

ccaacactga 

tgtggcaact 

gtctgtgcat 

ccagatttac 

caacaggcag 

tatagaacac 

tgtgatggta 

cagtggaaga 

taactaatgc 

aaactactac 

aaagtagctc 

gacattacaa 

acttaacatg 

tagttggccc 

ccctttgctg 

tggaatattc 

tatttttaaa 

ctgcctttgc 

tccctgtgtg 

caagatagga 

ccaggtcgtg 

atgactccca 

AGATTGAAGA 

CTGGAGCAGC 

CCTTTATCGA 

ggggggggtt 

ataaggcgtt 

agggggcatg 

ggatggagca 

acagggtggc 

agcccttaac 

gggatttggc 

gtaagccgaa 

tggaaacaga 

cacatggaag 

cttgtttagt 

ccagtttgag 

gtttaactct 

gtggggtggg 

gtctgtaggt 

atgttctcca 

atctgacagg 

catgccaagt 

agcattagca 

tccaggcagg 

atctacactg 

tcataagata 

gatattgagg 

ggctgagcca 

ctctccacct 



acaaacttaa 
gggaaagtga 
tgagagaaat 
cagcctgtgc 
attcccagca 
agttgttcaa 
agaaggttag 
attctggcag 
aaacgttgaa 
tgggcagatt 
aatcttatat 
ttacaacaac 
ccatttaaga 
aacaatacgc 
ccgtaatttg 
caatggtgag 
tctagaggtg 
agaaggttcc 
gcaaggaaaa 
ctccactggg 
gtgtttgacg 
aggccattgg 
acaccactct 
agctcaacca 
TGAGCTGGAG 
TCGAGGCCCA 
GGCTTCAAAA 
cccacgtgag 
tcccatatgt 
agagggagat 
acctcaggag 
acatcctcaa 
ataatcatac 
cctgctgact 
tgtcggtgta 
cactcctctc 
ccaggttaca 
tgttacttta 
agctctcaga 
cagagtgcaa 
ctacagcgac 
agagctgagc 
aactgctgac 
tgctgggatg 
gcccagtttc 
gtgggcagcc 
cactgggctc 
cttggcatct 
cccttgggga 
atcagagact 
ggatttaaac 
cacagtgccc 



tttgatggat 

caatttcagg 

gcacacaatt 

ttgctgtggg 

gccacccatc 

tagatggatt 

atggcaattg 

ggcccaaagg 

aattccacgg 

ccttgcttta 

tacattatag 

tgtttttaga 

tattttccta 

tgtaaataca 

aactttctgc 

agtttactgt 

tttatcattc 

atgggaaaca 

tgttttcata 

aacatgtttc 

cagaggctga 

cagtgtgacc 

tttgacagcc 

ttccaatgcc 

ATGACCATGG 

GACCAACTTC 

ATgtaagacc 

gctacactct 

gaggctgtac 

cacctggcta 

acagtaaacg 

ggaattcacc 

cttccaacct 

tatgctaacc 

ttaatttacc 

ccttcgctgg 

atcaaaagtg 

aatacattaa 

tacaatcctg 

tcttattccc 

atgcacctgg 

tgggtcagtt 

ccacagggat 

cctcttcctt 

ctaccattag 

gagttggaga 

agagggaaca 

accatggact 

agttattgtt 

ggcttggcca 

ccaggttttt 

attctctgac 



tccttttgta 

gaaaatctgg 

taatacacct 

ttgccacagg 

tcatcactat 

tggtaatatc 

attcttgatt 

aaatcactca 

acttggacct 

attgacagac 

tccataaggg 

gcctttcaaa 

gtatttaagg 

ttcagatttt 

taaatcatgc 

ttctttaaat 

tgcttacttg 

atttgaatat 

tggatttatt 

tctccaacgc 

cagggcaggg 

ccagcccaca 

cagattgtta 

atctcctctg 

TTTGCCATCG 

AC CAAGAGGG 

cgtgcacgct 

ccccaatgcc 

aaggaaggcc 

gaaaggaagg 

gccaactgcc 

ctggcccagg 

ggtggtgccc 

aggctcaccg 

ttgggaaatg 

gacccgctct 

gagtcagagg 

tgtgttcctg 

gatattaatt 

aaatcctgga 

tcaccctccc 

ccaaactgac 

tccagcccct 

ccacactagc 

ggcttctttc 

aggatcctgt 

gcatccagaa 

caataccacc 

acccccattt 

aagtcacaaa 

ctgatcttaa 

aatgtctcat 
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cattttgcaa 

gaaatgcaca 

aggctgggcc 

actctctccc 

ACATTCAAGC 

ccttgaaatc 

cctccaagtc 

aaactcaggg 

gctttagcag 

agggctgata 

gtattataat 

tactcaagat 

agggtctgaa 

ttttgaatct 

gaacttcagt 

gcaggaaaca 

cccacaacag 

tgtccccatg 

gtgtcacatg 

catcttccct 

cttgggccct 

ttggtgtgtg 

catgcaggcc 

attgagttct 

aaaaaatttc 

aaagggaagc 

tgtagtacag 

tcatcccagg 

gtgccattca 

tttctcatga 

agatctaact 

ccataatttt 

gtcgaagccc 

cccatcacct 

ATTACCTCTT 

GAGgtacgct 

cctccccctc 

gctctcagtc 

agatggggaa 

ctcaggggac 

cagtggcagc 

caattcccac 

ttacagaaga 

tcaaagagca 

ccctcaatac 

acggccactg 

acagggctgg 

agggggcacc 

gaaggcagga 

aggacgcagg 

cacagcttca 

tccccttctc 



agcagctcca 

ggtccctgtc 

cagtcttctc 

cttcccacag 

AGATCTATGC 

tatcttgccc 

ctctcttcct 

caggacacct 

gaaagaactc 

atgcaacatc 

cccagatgga 

tgcacagctg 

acaggttttg 

tttccttaaa 

aaataccgaa 

taaagttgtg 

ctgctctgtt 

ggcctgggac 

ccactttgca 

ggatcctgac 

gtttctcatg 

gggcccctac 

ccagaattgc 

cactgtatgc 

aagagatagg 

tcagagaggt 

cggcatccaa 

cattcatggc 

tacaccaacg 

tggctagagg 

tcacacccaa 

ctcctttcca 

tgccttgttt 

gccctccttt 

CAATGCCTTC 

catctggggt 

taaatctcaa 

agaccaaaga 

aagcagcagc 

taataacaac 

ccctgtactt 

aataacccta 

ggagagtaag 

cataaatgat 

agactgttcc 

ggatgataga 

ctgtgttgat 

ttgcaaaaaa 

gctcttcaca 

aggcaccatt 

cactgtctga 

agGACTTTGT 



gtcctgagat 

cctggggatc 

ctagtgctgg 

GAGTGCCCCA 

TCAGTTTTTC 

agctccctct 

tgccatttgc 

ccctcatcgt 

tgaaaatgaa 

acaacgtctc 

gtaactgagt 

gtgagtagca 

ccatgctggc 

atagctgaga 

gactataaaa 

cttattaagc 

gccctgcccg 

cacatcggtg 

tcctgtcacc 

aaaatgcaaa 

ggcctttttg 

ccagatcagg 

agtttgctga 

caggcacagc 

tcttgctaca 

cgtgcttgat 

gttcaggtct 

cgttcagaag 

actccatgca 

gaggggcaag 

acccaaagag 

tcaccctagc 

ggaaggttct 

tctgccttgt 

GACACCACTC 

ccactctagg 

ggcattgggg 

catgtttatc 

aactttccct 

agtgtatgag 

agggctttgt 

tagggtaggg 

gcaaagagag 

aaaggatgga 

tacagtccac 

cagaccactg 

ctgtgtctct 

aaacaaaaaa 

gagcatagcc 

ctgtgagagt 

tggcttgatg 

AACCGCTCTG 



ggcactactt 
atgaggaacc 
ccctcaaatg 
GTGGTGTGGT 
CCTCATGGAG 
ctggtaagca 
ttccttctcg 
gagaggtggg 
c'ccaccctgg 
agaacagctc 
ttcggggaag 
aaccaggatt 
accaccattg 
caagctggaa 
ataaactaat 
cagtttacaa 
actcctgtgg 
actcctcctg 
aagagctgtc 
tccaagtatt 
gcagcagaca 
tgggggaggc 
gttccactga 
tgtagtttaa 
tcccattgta 
gcccacagtc 
ctctactctg 
aacattgtca 
tagacaggca 
ggctcatctc 
ttgagtcaat 
atcactctcc 
ccctgtgtgg 
agATGCCAGC 
AGACAGGCTC 
ggtcctctgg 
gaaggtctgg 
catttgtaag 
ggcctgcagg 
cttccgggca 
atgtattaat 
ttttattatt 
atagagtagt 
tttgaatgta 
gtcctcagcc 
cagccatgga 
cagagctcca 
atggggcagg 
acatcctcca 
atcacagtct 
gttaatgtca 
TCGATTTTAT 



gggagaagtg 
ccagacacca 
cctcccgctg 
CAACGAAGAC 
gtgagtctga 
gccttccctt 
aggaagagac 
agtctccaaa 
aaggggaaga 
tagaaagcag 
ataagcagtg 
agattccata 
tgcagggcac 
ttttgtaaaa 
tgaaaaagag 
gtgtgccagg 
gaaccagctg 
tggcctccat 
tcctgcaaga 
ccaaacactt 
gatgccttcc 
agttttgtcc 
gccactgttt 
aatacttttt 
gagaggtagg 
acatagctta 
gagtgagtgc 
tgcagtcact 
ggagaatggt 
actttttgct 
gggccccact 
tctctttctt 
aattcctgcc 
ACGTATGCCC 
CGTGAAGTTC 
ttctgcatca 
accatcaaaa 
catttcctaa 
aactcaggga 
cactgcttcc 
tcatttactc 
gattaccttt 
tttcccaagg 
ggcagaatga 
actagaccat 
taaggcaaaa 
ttcttcctca 
gtagggaact 
ggcagacaag 
gacccaaaga 
ctctgccttt 
TGAGAGGAAC 
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TGTCCACGAG 
ACGGATACAT 
ggctccagtg 
atagcactgt 
tccctggcaa 
cccaagcgcc 
gattcagtag 
gagttactct 
gtcaaagcta 
aaaagaaaga 
ttaaactatt 
cagctgtctt 
ttgcactttt 
caggtctatc 
gtggaagctg 
tgtgccaggc 
tcaacccggc 
attcctcatt 
attaggtgtt 
cccacactgg 
atcagcaggt 
aaggaggtgt 
gttccaggca 
cttcatgtgt 
atgaagctga 
atggtggagg 
ttagatgaac 
aagaatcaga 
acagttaaaa 
gtgaattcta 
tggaacattt 
aaccaaagaa 
cctcgtgagc 
tcagcaatat 
taggaaggca 
tatatttaaa 
tgtctttatt 
ctacaaataa 
tgagttctat 
agaaaattaa 
aaaaattcaa 
attgctgaaa 
catgggttgg 
tttatagatt 
cagttgatat 
agaacaaagt 
tgtgcagtag 
tcaatggatc 
tgatccaaca 
tatatcaaag 
atagccaaga 
aaagaaaatc 



AAACTAAGGT 
AAACAAAGAG 
aaggtaacta 
ctgaatgagg 
aataaagaag 
cagaaaggat 
atattacaca 
tccctaggga 
aaagttgaaa 
tttttaagga 
ttgcctatta 
tacagaaatt 
agaacctcag 
tcaaaactca 
gaaattggca 
acggtgcctg 
aagcttctct 
catttagtca 
ctaggtgctg 
agcagcatag 
agaagatatt 
cgtcatactg 
gaaggaacag 
ttcaaaaaca 
agactagtgc 
gc'tagggtaa 
atgttttttg 
aaatctgact 
accattccca 
atatcaatcc 
cccagcttgt 
aaaaaggtct 
acagataaaa 
ataaaaagga 
aggttggttt 
gaaaccaaag 
tgtacatggt 
actgctaaaa 
ttacaaaaat 
attaagaaaa 
caaaacaggt 
taaagtaaag 
aaggcccaat 
tagtgtgatc 
gggaatgcaa 
tgaaggacta 
tcaagatggt 
aaaatagaga 
atcccactgc 
agatatctgc 
tttggaagca 
tggtacttat 



GGACATTTAA 
gtaagtgagc 
acccaacaga 
caggctctgc 
gcctccctca 
taaacagatt 
gtggataaaa 
acctgcattt 
tggtggaatt 
agaggaacct 
atacaaccgc 
atcacaaata 
aggacatgtc 
ctttggtatc 
tcctgtaaca 
gaggtataga 
ctgtaagctc 
tgtttttaat 
gaggtaaaga 
agagatggac 
ttggcaggga 
gaaccagtta 
ccactagaga 
gaaaggcttc 
aggatgagcc 
ggggcccatc 
aaagacacaa 
agctctgtat 
aataaaatta 
tacaaaaaaa 
tttataagtc 
tatgaaagga 
aagctttcac 
tgatacaata 
aacattcaaa 
ggatatagac 
ataattgtat 
ataataagtg 
catttataat 
atagcatcaa 
gcaagactca 
aactaaataa 
atagttaaga 
tcaatcaaaa 
aaaacctaaa 
aacctacctt 
agggaattgg 
gtcaaaaaaa 
tggctatata 
actcccatgt 
acctaagtgt 
acacaatgga 



TTTGTATGAC 
tggggccagg 
aaacagcccc 
tttggggcta 
ttgccctacc 
cattctcact 
atgacttgtt 
gggaaggtta 
gtaggcagca 
acaattgggt 
caagggggta 
atttctgaat 
tctagccagt 
cactgtatcc 
ctccacttgc 
tatcagcacc 
agagagttta 
taacacagat 
ggtgaacaat 
aataaacagg 
aggttagcaa 
tacaaaaagc 
ggctttgagg 
agatggctgc 
tgacaggtca 
catattcccc 



aaggcaaaaa 
tatttagaga 
taggcccaga 
ttttagaaaa 
caatagtacc 
aacaactaca 
taaatattag 
caactatgtg 
aaccaatcaa 
tggaaaggaa 
taatagaaaa 
aatttaacaa 
tctacatcca 
aaagcacaaa 
tacactgaaa 
acacagagat 
tgtggattct 
tcccagcagg 
ataaccaaaa 
acttcaagac 
cataaggata 
ctaaaagtag 
ccccatgtaa 
tttttgcaga 
ccatcaacag 
gtacttctgc 



ATCAACAAGG 
ggtgtgagag 
aggcatgagg 
acagagctgg 
ctgccctgtt 
gggtcaccta 
tcagtgtgaa 
ggagccacaa 
cctagaatag 
catattggcc 
atggaaggta 
cttcactgct 
gaaataccct 
tggtatctca 
tgagctcctg 
aatcttcacc 
tccaactttc 
ttgaaggctt 
cagactaggc 
ccaaaaagga 
aggcctctcc 
caggaaaagt 
caagatagag 
agatgaaggt 
gcagagtcag 
tgggaagctc 
cagacttaaa 
aattatattc 
tggtggactg 
tagaggagga 
ctgataccca 
aaacagtctc 
caaattgaat 
agttttattc 
tataattcac 
gaattaacac 
ccctaagagt 
agtgaagaca 
acaagcaacc 
atactttgag 
actataaaat 
aaatcatgct 
ccaaatttga 
tattttgtag 
catctgaaaa 
ttaatgaaaa 
gacatataga 
agctaccata 
aggaaaccag 
tctgttcata 
atgaatggat 
cataaaatag 
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aatgagattt 
aagtgaaatg 
tttgttggat 
ggttaccaga 
atagtcaatg 
tggtagcaca 
ctaaaagagt 
gggatggata 
tatcaaaact 
acaaaaatta 
cacccatatg 
cgtgttttca 
ttaacctcat 
aatattaaaa 
ttacaacttt 
aatttgactt 
agaaaatgga 
ataccagata 
atccaaaata 
tcatataaaa 
tatgaatggc 
aaagaaatgc 
acaattaaaa 
agaaactctc 
ggaaacattt 
cccagaaatc 
gtactttggg 
accagactgg 
aaaaaaaaaa 
ctagggaggc 
cagtgagctg 
accctgtctc 
ggatatttac 
aagatgttca 
caaatgtctg 
caatggaata 
aatgtgtatg 
aaatgagttt 
aaactaatct 
gcttgagagg 
tgaaaatgtt 
tttgtcaaca 
agtaaattat 
aagggggaag 
aaaaaaatcc 
catccaaaat 
ccagtgctct 
ctgaggcagg 
acatagggag 
tgtagtggca 
ggatagcttg 
accacactcc 



aatcatctac 
aaccaggcac 
ctaaaaatca 
ggctgggaag 
ggtacaaaaa 
acaaggtgac 
gtaattggtt 
cccatttttc 
tgtcatgtac 
aaaataaaaa 
tcaatgcaac 
cagctgtata 
gccaaacata 
caaaagctgt 
ggagtaggaa 
tatcaaagtt 
aatgcaagcc 
tataaagata 
tataagggac 
taggcaaaag 
cagtaaaccc 
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taaaataaaa 
atgcaggaat 
ttaatatatc 
aatttaccat 
aaatacttct 
atactttatg 
gaaaagatga 
tgtcagacat 
tgaaacagaa 
tgttaatgac 
cttaaaacaa 
atacatatgc 
tagaaaattg 
gcacatgggc 
tttgataagt 
gatagcaata 
ttttgaatca 
ttataaagtt 
ttcaaaaaga 
attataaagc 
tagatcaatg 
cata^taaag 
tcttggaaaa 
ctcggtacca 
actaaaaaag 
aaagcataaa 
acactgtaaa 
gttgaaaagt 
acagaaagga 
agtcatattc 
acatggtaga 
atattttaac 
atccagtcag 
ttcatgggaa 
taagaaagta 
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ttttcataca 
tatttgagga 
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atacatataa 
ataaaataaa 
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aggaaaagca 
tttaaaaata 
tgagggcacc 
cctagacttt 
cttgaagaat 
cctggagtac 
actgcaggag 
ttgagcaaaa 
ggtaaatgcc 
aataaatgca 
ataagtatag 
tacagtaata 
aaggaaactt 
tcagtaataa 
agaaggttat 
tcaatctaat 
tacaaaacaa. 
caaggaaaca 
ccagcacttt 
gagaccagcc 
caggtgtggt 
gggaggatca 
gccactgcac 
aaaaaaaaga 
agactaagga 
aaaaatatta 
ccacagagcg 
aagaaaagct 
tgaagccatg 
gttgttaaca 
actaattttg 
aattaaacaa 
tgggtctctg 
attctgagtt 
ataaacacgg 
ggtttggagt 
tggaactatt 
tttttcttaa 
taggctagaa 
tcaagcctgt 
cccataacaa 
ctaccatctc 
. gcttctaact 
tttcattgcc 
gcttgacctc 
AGATGATGGA 
TATCCTGTGC 
CCAGgtaagt 
aagtcaaccc 
aagcatggtt 
acagatgatc 
agtatattat 



tcaatggcca 
aacacatgga 
tcagagcaat 
gatttctaaa 
agttgattct 
tttattatgc 
tcaacctgaa 
ttaaaaatta 
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aaggaataaa 
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TCAAAGAGGA 
gcacacaccc 
acgagcatct 
ggtaacagaa 
tccttaagct 
taaacgctag 



tgtagatgtg 
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taccattctt 
aagattggga 
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gtcaaacatc 
ggtcagaata 
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actgatcttt 
gccctgtgga 
tggcagaaac 
ttaaaaattc 
cccacaaatc 
ccatgagctt 
agggctccca 
tgctaaagaa 
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cactgtgtgc 
GCCATCTATG 
CACTCCAAGG 
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gcaagtgtac 
tggtatagta 
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atttggggat 

ggtgttaagc 

aggatagggc 

ctggtacagg 

aatcattggg 

gaatatcaag 

agagagtcct 

gaatcagaac 

ggagtctaca 

agagcaagtc 

tcaggaccta 

agggcaaagt 

ctgagctcca 

agattctgta 

AAAGATGGCA 

aaggagagat 

gggcctgggg 

caacccctcc 

agttccactc 

tgaacagaga 

ggtctctgtg 

cctgcctctg 

ggggaaaaaa 

aattgcct.gg 

agactctaga 

taacacttta 

gattctaatt 

cattctcccc 

ttaagtctaa 

aaataattgc 

ttttagaaat 

atgtaagaaa 

gtttcaagca 

tatgtgcagg 

ctgaccccct 

tgtaaaacag 

ccatgggatc 

aagcagcctg 
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ttctgatatt 

tattgatttt 

ctctattgca 

gtatagtatt 

cacttctaat 

taaaaatatg 
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gacctttaaa 

agtgctgatc 
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gtgcaagttt 

cagtgttctg 

aataagccag 

aggaagctca 
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aggggagtaa 
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aataactcta 

ggaaggtgat 
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cctgcagttg 

gggtctgggg 
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tttcccagGA CGACAACATC ATGAGGTCTC TCCAGCTGTT TCAAAATGTC 
ATGTAActgg tgacactcag ccattcagct . ctcagagaca ttgtactaaa 
caaccacctt aacaccctga tctgcccttg ttctgatttt acacaccaac 
tcttgg 
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Amino acid sequence of KChlPl 



MGAVMGTFSSLQTKQRJRPSKDKJEDELEMTMVCHRPEGLEQLE 
AQTNFTKRELQVLYRGFKNECPS GWNEDTFKQIYAQFFPHGD ASTY A 
HYLFNAFDTTQTGSVKFEDFVTALSILLRGTVHEKLRWTFNLYDINKr) 
GYINKEEMMDIVKAIYDMMGKYTYPVLKEDTPRQHVDV^ 
KDGIVTLDEFLESCQEDDNIMRSLQLFQNVM (SEQ ID NO: 2) 
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Nucleic acid sequence encoding the amino acid sequence of KChlPl 



ATGGGGGCCGTCATGGGCACCTTCTCATCTCTGCAAACCAAA 
CAAAGGCGACCCTCGAAAGATAAGATTGAAGATGAGCTGGA 
GATGACCATGGTTTGCCATCGGCCCGAGGGACTGGAGCAGC 
TCGAGGCCCAGACCAACTTCACGAAGAGGGAGCTGCAGGTC 
CTTTATCGAGGCTTCAAAAATGAGTGCCCCAGTGGTGTGGTC 
AACGAAGACACATTCAAGCAGATCTATGCTCAGTTTTTCCCT 
. CATGGAGATGCCAGCACGTATGCCCATTACCTCTTCAATGCC 
TTCGACACCACTCAGACAGGCTCCGTGAAGTTCGAGGACTTT 
GTAACCGCTCTGTCGATTTTATTGAGAGGAACTGTCCACGAG 
AAACTAAGGTGGACATTTAATTTGTATGACATCAACAAGGA 
CGGATACATAAACAAAGAGGAGATGATGGACATTGTCAAAG 
CCATCTATGACATGATGGGGAAATACACATATCCTGTGCTCA 
AAGAGGACACTCCAAGGCAGCATGTGGACGTCTTCTTCCAG 
AAAATGGACAAAAATAAAGATGGCATCGTAACTTTAGATGA 
ATTTCTTGAATCATGTCAGGAGGACGACAACATCATGAGGT 
CTCTCCAGCTGTTTCAAAATGTCATG 
(SEQ ID NO:3) 



FIG. 3 
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